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PREFATORY  NOTE 

When  Professor  Heitler  invited  me  a  few  years  ago  to  write  a  book  on  cosmic 
ray  theory,  I  accepted  it  with  a  modest  purpose  in  view — to  present  a  theoretical 
account  of  various  phenomena  associated  with  cosmic  radiation.  Since  such 
phenomena  included  interactions  of  elementary  particles  and  their  production 
at  the  place  of  their  origin  and  in  their  passage  through  matter,  it  was  thought 
necessary  to  start  the  book  with  an  introduction  dealing  with  quantum 
mechanical  collision  processes.  The  writing  of  this  introduction  slowly  inveigled 
me  into  the  fascinating  domain  of  fundamental  physics  and  the  scope  of  the  book 
gradually  became  enlarged.  The  introduction  thereby  stretched  to  two-thirds 
the  span  of  the  book  till  the  role  of  the  introductory  and  substantial  parts  became 
reversed.  It  now  professes  to  be  a  book  of  elementary  particles,  their  interactions 
being  described  through  cosmic  ray  phenomena. 

The  main  stimulus  for  such  a  transformation  came  through  my  close  association 
with  leading  physicists  abroad  during  my  stay  at  the  Yukawa  Hall,  Japan,  at  the 
kind  invitation  of  Professor  Yukawa  under  the  auspices  of  the  Asia  Foundation,  and 
the  following  visit  to  the  various  centres  of  fundamental  research  in  experimental 
and  theoretical  physics  in  the  United  States  and  Europe.  My  next  visit  to  the 
United  States,  again  with  the  assistance  of  the  Asia  Foundation,  and  my  stay  at  the 
Institute  for  Advanced  Study  where  I  enjoyed  the  hospitality  of  Professor  Oppen- 
heimer  brought  me  almost  "face "to  face"  with  some  of  the  creators  of  modern 
physics,  especially  at  the  time  when  the  prediction  and  demonstration  of  non- 
conservation  of  parity  was  attracting  the  attention  of  physicists  throughout  the 
world.  While  of  course  by  the  end  of  my  visit  to  Princeton  I  was  desirous  of 
shifting  the  emphasis  of  the  book  to  elementary  particles,  I  would  not  have  under- 
taken the  task  but  for  the  willing  and  enthusiastic  co-operation  of  Mr.  N.  R. 
Ranganathan,  one  of  the  most  promising  young  members  of  the  theoretical  physics 
group  at  Madras.  Other  members  of  the  group  also  convinced  me  not  merely 
of  the  necessity  of  writing  a  book  with  an  essentially  physical  approach 
to  problems  involving  complex  and  formal  mathematical  methods  but  also  of 
its  possibility. 

It  is  a  pleasure  to  acknowledge  in  some  detail  the  help  rendered  to  me 
by  my  young  students  in  the  writing  of  this  book;  Mr.  P.  Rajagopal  in 
regard  to  the  sections  on  single  particle  wave  functions  and  wave  equations, 
Mr.  A.  P.  Balachandran  on  self  energy  and  renormalization,  Dr.  R.  Vasudevan 
and  Miss  T.  K.  Radha  on  Feynman  processes,  Mr.  G.  Ramachandran  on  primary 
cosmic  radiation.  Dr.  S.  K.  Srinivasan  and  Miss  Thunga  on  geomagnetic  effects, 
Mr.  N.  R.  Ranganathan  on   new  particles,  Mr.  K.  Venkatesan  on  meson  pro- 
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cesses  and  the  Heisenberg  representation,  Miss  S.  Indumathi  on  cascade  pro- 
cesses, Miss  G.  Bhamathi  on  hyperfragments  and  Mr.  V.  Devanathan  on  angular 
momentum. 

Four  of  my  students,  Mr.  A.  P.  Balachandran,  Miss  S.  Indumathi,  Miss 
G.  Bhamathi  and  Miss  R.  Thunga  scrutinized  the  manuscript  with  critical  care 
and  I  enjoyed  the  frank  and  free  discussions  we  had  on  the  subject  matter  of 
each  chapter.  I  am  particularly  grateful  to  Miss  T.  K.  Radha  and  Mr. 
P.  Rajagopal  for  undertaking  the  arduous  and  responsible  task  of  carefully 
preparing  the  manuscript  with  equations,  graphs  and  tables  for  publication, 
and  my  assistant  Mr.  E.  T.  Nambi  Iyengar  for  typing  the  book,  a  task  rendered 
rather  tiresome  in  view  of  the  constant  changes  that  had  to  be  made  before 
publication. 

The  tenor  and  style  of  the  book  have  naturally  been  determined  by  the  parti- 
cular background  of  my  scientific  education  under  my  great  teachers  Professor 
H.  J.  Bhabha  who  first  initiated  me  into  theoretical  physics  and  Professor 
M.  S.  Bartlett  who  taught  me  the  methods  of  probability  and  stochastic  theory. 
From  both  of  them  I  have  attempted  to  learn  one  valuable  lesson — that  it 
is  advisable  for  a  scientist,  particularly  a  theoretical  physicist,  to  combine  initiative 
with  caution.  This  book  is  the  product  of  such  an  effort.  In  presenting  established 
results  and  well-known  theories  I  have  taken  care  to  adhere  closely  to  the  treat- 
ments of  the  authors  themselves.  In  doing  this,  I  had  to  make  demands  on  the 
generosity  of  the  authors,  in  particular.  Professors  Yang,  Feynman,  P.T.Matthews, 
Jauch,  Chew  and  Kallen  and  their  publishers  who  have  without  hesitation  per- 
mitted me  to  draw  freely  from  their  work.  I  am  grateful  to  them,  to  the  Office  of 
the  Technical  Services  and  to  the  Addison- Wesley  Publishers  for  such  kind 
permission. 

A  rather  detailed  description  of  strange  particle  interactions  is  given  in  two 
chapters  to  illustrate  the  complexity  of  the  maze  which  is  puzzling  even  the 
most  creative  minds  in  physics  today.  The  nature  of  this  challenge  can  be  realized 
only  by  the  wild  and  desperate  attempts  that  are  being  made  to  discern  "a  plan 
behind  the  maze".  Part  I  concludes  with  a  summary  of  such  current  ideas  and 
shifting  speculations. 

The  exposition  in  the  rest  of  the  book,  particularly  relating  to  the  theory  of 
scattering  and  perturbation  expansions  in  Part  I  and  Cascade  Theory  in  Part  II, 
is  essentially  influenced  by  my  leanings  towards  the  theory  of  probabiHty  and  the 
mathematical  methods  associated  with  it.  The  kernel  function  formalism  of  Feyn- 
man and  its  relation  to  field  theory  have  been  described  from  a  point  of  view  and 
in  a  sequence  different  from  the  conventional;  the  old  wine  has  been  placed  in 
new  bottles  mainly  as  a  tribute  to  the  delectable  vintage  which  has  become  the 
stock-in-trade  of  the  theoretical  physicist  today. 

Cascade  theory  has  been  presented  in  a  portmanteau  form  made  possible 
only  by  recent  methods  and  techniques  developed  in  stochastic  theory;  the  descrip- 
tion of  geomagnetic  effects  has  been  condensed  without  omitting  the  intricate 


) 


PREFATORY     NOTE  XV 

details  which  alone  can  reveal  the  exquisite  beauty  of  the  geometrical  pattern 
of  the  motion  of  charged  particles  in  a  magnetic  field. 

Like  the  postscript  of  a  letter,  some  of  the  appendices  deal  with  my  own  ideas 
which  I  was  hesitant  to  include  in  the  main  body  of  the  book  as  they  relate  to 
theories  which  are  considered  too  well  established  to  require  criticism.  While 
I  am  aware  that  a  "blue-blooded"  theoretical  physicist  will  be  reluctant  to 
accept  the  intrusion  of  "commonplace"  probability  theory,  I  am  encouraged 
to  discuss  the  role  of  probability  in  quantum  mechanics  since  it  seems  to  have 
engaged  the  active  attention  of  Professor  Feynman,  one  of  the  creators  of  modern 
quantum  electrodynamics.  I  have  interpreted  many  of  his  casual  remarks  in 
his  papers  to  imply  that  all  has  not  been  said  on  the  Pauli  principle. 

I  became  more  conscious  of  my  shortcomings  after  the  completion  of 
the  book  than  at  its  commencement.  I  have  not  attempted  to  lead  the  reader 
but  preferred  to  walk  by  his  side  to  the  base  of  the  pyramid  of  modern 
physics  which  is  too  mighty  for  single  minds,  save  the  most  gifted,  even  for 
contemplation. 

I  do  not  know  whether  it  is  wisdom  or  impertinence  to  write  a  book  on  the 
theory  of  elementary  particles  and  cosmic  rays  at  a  time  when  modern  physics 
is  offering  its  greatest  challenge  since  the  birth  of  relativity  and  quantum  mecha- 
nics. But  from  a  humbler  pedagogic  point  of  view,  I  could  not  have  chosen  a  more 
propitious  occasion  for  its  publication,  to  meet,  in  a  small  measure  at  least,  the 
needs  of  scientific  education  in  my  country.  This  book  is  just  the  projection  of 
the  newly  awakened  hopes  and  aspirations  of  the  young  scientific  community 
in  my  home  town  and  university  and  its  nascent  interest  in  mathematical  sciences. 
It  is  gratifying  that  the  efforts  of  this  enterprising  group  have  attracted  the 
sympathetic  attention  of  Professor  Niels  Bohr  and  Professor  Abdus  Salam  who 
convinced  me  that  the  writing  of  this  book  was  a  pedagogic  necessity  in  my 
country  especially  at  the  inception  of  advanced  scientific  effort  on  an  organized 
scale. 

The  writing  of  any  text  book  is  considered  a  drudgery  but  to  me  it  has  been 
a  most  profitable  and  exciting  experience,  made  more  so  because  of  the  peculiar 
and  exhilarating  circumstance  that  every  chapter  of  this  book  was  discussed 
at  informal  meetings  with  my  students  in  my  family  home  in  an  atmosphere 
of  leisure  and  comfort.  On  this  occasion  I  wish  to  express  my  gratitude  to  the 
distinguished  Vice-Chancellor  of  our  University,  Dr.  A.  L.  Mudaliar,  who  was  kind 
enough  to  recognize  the  work  of  this  group  and  to  the  farsighted  Finance  Minister, 
Mr.  C.  Subrahmaniam  for  the  creation  of  the  Institute  of  Mathematical  Sciences 
to  provide  adequate  facilities  for  future  effort. 

It  has  been  a  particular  pleasure  and  privilege  to  me  that  Professor  Heitler, 
one  of  the  leading  theoretical  physicists  in  Europe,  has  been  the  main  sponsor  of 
this  book.  The  manuscript  was  revised  in  the  light  of  his  detailed  comments  given 
with  such  amiable  candour  and  friendly  interest,  and  I  cannot  adequately  express 
my  gratitude  to  him  for  his  kindness  to  the  point  of  leniency  in  allowing  me  so 
much  time  for  the  preparation  of  the  book  and  latitude  in  enlarging  its  scope. 
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I  am  grateful  to  the  Pergamon  Press  for  undertaking  the  publication  of  this 
work  and  to  Captain  I.  R.  Maxwell,  its  Managing  Director,  for  tolerating  the 
almost  inexcusable  delay  in  submitting  the  manuscript.  The  delay  has  placed 
a  greater  responsibility  on  me  since  this  book  will  have  to  deserve  to  be  a  sequel 
to  two  important  works  published  by  the  Pergamon  Press  on  theoretical  astro- 
physics and  experimental  work  on  elementary  particles.  Therefore,  before  committ- 
ing it  to  publication  I  was  anxious  to  present  its  contents  through  lectures,  and 
such  an  opportunity  arose  when  Professors  A.  Mercier  and  S.  Fliigge  invited 
me  to  Berne  and  Marburg  respectively.  I  shall  consider  my  efforts  worthwhile 
if  this  book  helps  a  new  entrant  to  realize  the  depth  and  extent  of  the  great 
stir  that  is  taking  place  in  physical  science  today. 

Alladi  Ramakrishnan 
'Ekamra  Nivas\ 
27  Luz  Church  Road, 
Madras- 4,  India. 


Part  One 
ELEMENTARY  PARTICLES 


''A  mighty  maze,  but  not  without  a  plan" 

A.  Pope 


CHAPTER  I 

SINGLE  PARTICLE  WAVE  FUNCTIONS' 
AND  WAVE  EQUATIONS 

1.  BASIC  POSTULATES  OF  WAVE  MECHANICS 

t 

Concept  of  the  wave  function  and  the  Schrodinger  equation 

In  classical  physics  an  elementary  particle  in  the  absence  of  interaction  is 
characterized  only  by  one  "intrinsic"  attribute,  its  mass 2.  Other  properties  are 
ascribed  when  we  consider  its  interaction  with  fields  of  force.  For  example, 
charge  can  be  regarded  as  a  constant  characteristic  of  the  interaction  of  the 
particle  with  an  electromagnetic  field.  Dynamical  attributes  like  energy  and 
momentum  are  not  "intrinsic"  in  the  sense  of  mass  and  charge.  The  variations 
of  these  dynamical  attributes  with  time  and  the  relations  between  them  are 
given  by  the  equations  of  motion. 

In  quantum  mechanics  a  free  or  interacting  particle  is  described  by  what  is 
known  as  its  wave  function  which  is  a  function  of  its  "intrinsic"  attributes  and 
the  dynamical  variables.  Let  us  ignore  for  the  moment  the  concept  of  spin  which 
is  of  course  an  intrinsic  quantum  mechanical  attribute.  It  is  customary  to 
represent  the  particle  by  a  complex  function  ip  of  its  spatial  and  temporal  co- 
ordinates (x,  y,  z,  t)  or  by  a  complex  function  99  of  its  momentum  and  time  co- 
ordinates {Px,  Py,  Pz^i)-  These  wave  functions  in  configuration  and  momentum 
spaces^  have  a  probabilistic  interpretation.  ip*y)  d^x  represents  the  probability 

1  W.  Pauli,  Handhudi  der  Physik,  Vol.5,  Springer-Verlag  (1958).  / 

^  In  this  point  of  view  we  take  mass  as  the  rest  energy  of  the  particle  and  not  as  the  constant 

characteristic  of  the  interaction  with  the  gravitational  field.  The  question  of  the  origin  of 

this  rest  energy  is  of  course  fundamental  and  attempts  have  been  made  to  interpret  this 

energy  as  arising  due  to  the  interaction  with  the  self  field. 

^  The  following  notation  will  be  used  as  regards  space-time  points  x,  y,  z,  t:  x  denotes 

the  space-time  point  with  components  x,  y,  z  and  ^.  a;  is  a  space  point  with  components  x,  y,  z. 

In  a  corresponding  manner  a  four  vector  a  has  components  (x^,  (Xy,  <x^  and  (x^  and  cc  represents 

a  three- vector  with  oc^,  ocy,  oc^  as  its  components.  The  scalar  product  of  two  three-vectors  cc 

and  ^  is: 

a-  ^=(xj,+  (Xy^y  +  oij, 

but  for  four-vectors  we  define  the  scalar  product  as 

a  •  /5  =  oiiPt  —  ocj^  —  (Xy^y  -  xj, . 
We  also  use  the  notation 

d*x  =  dxdydzdt     and     d^x^dxdydz. 

Under  normal  circumstances,  indices  1,  2,  are  used  to  distinguish  between  two  four- vectors 
and  not  for  their  components.  Sometimes  it  is  more  convenient  to  use  1,  2,  3,  4  instead  of  a;, 

1*  3 
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that  the  particle  is  found  in  the  volume  element  d^x  while  (p*(p  d^p  represents 
the  probability  that  the  particle  lies  in  the  momentum  volume  element  d*^. 
The  quantum  mechanical  description  of  a  particle  in  terms  of  itS  wave  function 
is  based  on  two  fundamental  principles,  those  of  complementarity  and  super- 
position. 

The  principle  of  complementarity 

Without  going  into  any  discussions  on  the  epistemological  basis  of  comple- 
mentarity, the  principle  in  relation  to  wave  functions  can  be  stated  as  follows. 

A  particle  can  be  described  either  by  its  wave  function  in  configuration  space 
or  in  momentum  space,  but  not  by  a  function  of  both  momentum  and  spatial 
coordinates^.  In  the  simpler  language  of  probability,  this  means  that  we  can 
either  get  the  probability  distribution  in  configuration  space  or  in  momentum 
space  but  not  the  joint  distribution  in  both  the  spaces  as  will  be  explained 
presently.  The  statement  regarding  the  amplitudes  tp  and  99  is  deeper  and  more 
fundamental  than  regarding  probability  distributions. 

The  principle  of  complementarity  is  based  on  the  fundamental  postulate 
that  99  and  ip  are  connected  by  the  relation 

yj{x,  t)  =  {2  7i  h)-  ^''^Jcpip,  t)  exp  (-^^^^)  d^  f,  (1) 

p 

or  equivalently  by 

(^(p,0  =  (2^^)''Y^(a?,0exp(-^^^^jd3a:.  (2) 

X 

This  implies  that  if  X^ ,  Xy  and  Xz  are  the  wavelengths  associated  with  the  x,  y  and  z 
directions  for  the  wave  function  with  spatial  dependence  of  the  type  e"*"***  */*, 

i.e.  Xx  =  — ;      Xy  =  —     and     Xz  =  — 

Vx  Vy  Vz 

P)  Pi  Pi 

then  it  is  an  eigenstate  of  the  operators  — ih-—-,  — ih— — ,  — ^^~^  with 
eigenvalues  p^ ,  py  and  p^  respectively. 

y,  z,  t  to  denote  the  components  of  a  four-vector.  This  is  usually  done  when  we  wish  to  sum 
over  the  four  components  x,  y,  z,  t  and  we  then  use  a  dummy  index  %,  the  summation  extend- 
ing from  1  to  4.  However  the  distinction  between  the  two  different  modes  in  the  use  of 
the  indices  will  be  apparent  from  the  context. 

Except  in  the  first  chapter  we  shall  set  ^  =  1,  c  =  1,  i.e.  we  work  in  natural  units.  The 
conversion  table  for  transforming  natural  units  into  usual  units  is  given  in  the  appendix. 

^  There  are  some  who  believe  that  it  is  possible  to  define  a  function  ^{p,  x)  such  that 
^*0  (j3^  (j3^  represents  the  joint  probability  that  the  particle  lies  in  the  spatial  volume  d^x 
and  the  momentum  volume  d^^,  i.e.  tp*ip  and  (p*(p  are  marginal  probability  distributions. 
But  in  such  a  case,  the  relationship  between  0,  y)  and  99  is  obscure  and  it  is  not  clear  whether 
the  principle  of  complementarity  as  envisaged  in  equations  (1)  and  (2)  will  be  consistent 
with  the  existence  of  0(p,  a:). 
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In  classical  theory,  given  the  field  with  which  the  particle  interacts,  all  the 
dynamical  variables  are  determined  by  the  equations  of  motion.  In  particular, 
the  spatial  and  momentum  coordinates  of  the  particles  can  be  determined  uni- 
quely. In  quantum  mechanics,  the  particle  is  described  only  in  terms  of  the  wave 
function  y)  or  (p.  We  can  thus  speak  only  of  the  mean  values  of  the  dynamical 
variables  of  the  particle.  For  instance,  the  mean  value  of  the  x  coordinate  can  be 
obtained  from  y)  as^ 

x^jip*xtpd^x.  (3) 

Since  the  particle  can  be  represented  either  by  ^  or  99  it  is  possible  to  get  the  mean 
value  of  X  from  (p  functions  also  as 

x^{2  7ih)        j'tp*  xd^xj  q){p)ex-p  I — jd^^ 

=  {27thy^'^  lip*  d^xjih [9'(l>)]exp[— — — jd^^ 

=  (2  TT  hy  ^^^  I  i  h  -- — -   99  {p)   d^p  I  y)*  exp  I  —^ —  j  d^x 

=J  (P*  iP)  i  h  J--  [(P  iP)]  d'  ?>.  (4) 

Therefore  to  obtain  the  mean  value  of  the  dynamical  variable  x  we  have  to  insert 

the  operator  i  h  — —  between  (p*  and  99  if  we  are  dealing  with  wave  functions  in 

momentum  space.  The  same  considerations  apply  to  the  coordinates  y  and  z 
and  to  any  dynamical  variable  oc  which  is  a  function  only  of  the  coordinates  x,  y 
and  z.  To  obtain  the  mean  value  of  oc  when  the  wave  function  is  represented  in 
momentum  space  we  have  to  express  oc  in  terms  of  x,  y  and  z  and  replace  them  by 
operators,  insert  oc  between  99*  and  99  and  integrate  over  momentum  space. 
Conversely  if  we  wish  to  find  the  mean  value  of  the  momentum,  it  is  given  by 

p^=J(p*p^(P'd^p.  (5) 

p 

But  if  we  work  in  configuration  space 

p-^=  Lp*l—ifi—^jy)d^x,  (6) 

i.e.  we  have  to  represent  p^  by  the  operator  — ihdjdx.  The  representation  of  p 
by  an  operator  when  working  in  configuration  space  and  x  by  an  operator  when 
working  in  momentum  space  is  equivalent  to  writing  the  commutation  rules 
for  momentum  and  space  coordinates  as  follows : 

[X,  Px\  =  [y,  Py]  =  [Z,  Vz]  =  i  h  (7) 

1  We  adopt  the  convention  that  the  symbols  J  and  27  imply,  except  when  otherwise  stated 
explicity,  integration  and  summation  over  the  entire  permissible  range. 
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All  other  pairs  of  dynamical  variables  taken  from  Pz,  Py,  Pz^  ^>  V  ^^^  ^  commute 
with  each  other.  If  a  dynamical  variable  j5  is  a  function  of  p  only,  its  mean  value 
can  be  obtained  directly  from  the  cp  function.  In  configuration  space,  we  have 
to  represent  p  by  an  operator  in  P{p).  Speaking  quite  generally,  if  two  dynamical 
variables  are  canonically  conjugate  in  the  classical  sense  like  the  space  and  momen- 
tum coordinates,  one  of  them  is  to  be  represented  by  an  operator  when  we  work 
with  the  wave  function  in  the  space  of  the  other  variable. 

The  question  now  arises  as  to  how  we  can  determine  the  mean  values  of  func- 
tions of  both  the  space  and  momentum  coordinates.  Examples  of  such  functions 
which  play  a  dominant  role  are  the  Lagrangian  and  the  Hamiltonian.  It  is 
obvious  that  we  should  express  such  functions  in  terms  of  both  space  and  momen- 
tum coordinates,  and  replace  the  momenta  by  operators  in  configuration  space 
or  replace  the  spatial  coordinates  by  operators  in  momentum  space.  The  coordinate 
canonically  conjugate  to  the  Hamiltonian  H  is  t  and  hence  H  can  also  be 

represented  by  the  operator  ^^~^-  Equivalence  of  this  operator  with  that  of  the 

Hamiltonian  expressed  in  terms  of  space  and  momentum  coordinates,  gives 
the  equation  of  motion 

ih^  =  Hy>  (8) 

known  as  the  Schrodinger  equation  representing  the  temporal  evolution  of  the 
wave  function  ^.  The  wave  function  is  considered  stationary  if  the  only  time 
dependence  of  ^  is  of  the  form  e~*^^/*,  i.e.  ip  can  be  written  as^ 

\p{x,  y,  z,  t)  =  e-^^^l^xp{x,  y,  z)  (9) 

where  ^  is  a  constant  presently  to  be  identified  with  energy.  In  such  a  case, 
substituting  for  ip  in  the  equation  of  motion,  we  get 

Hy)(x)  =  Eyj{x)  (10) 

where  ip  satisfies  the  time  independent  Schrodinger  equation.  This  leads  to  the 
problem  of  finding  the  eigenvalues  E^^  and  the  eigenfuntions  tp^  corresponding 
to  the  Hamiltonian  operator  H.  The  eigenvalue  E^^  of  H  is  the  energy  of  the 
particle  if  the  particle  is  in  a  state  yj^  •  The  eigenstate  of  energy  can  also  be  the 
eigenstate  of  the  momentum  operator  and  such  an  eigenfunction  will  have 
space  and  time  dependence  of  the  type  q-'^v-xIh  ^  Q{-iEt  +  ip-x)ih  corresponding  to 
a  frequency  v  with  hv  =  E. 

If  the  particle  is  in  interaction  with  external  fields,  the  interaction  is  represented 
through  the  inclusion  in  the  Hamiltonian  of  terms  corresponding  to  these  fields 
and  stationary  solutions  may  exist  under  suitable  circumstances  for  particles 
in  such  fields.  In  such  a  case  the  particle  is  supposed  to  be  bound  to  the  external 
field  and  the  corresponding  wave  function  '^p^^  represents  the  bound  state  of  the 

^  For  convenience  we  may  use  the  same  symbol  to  denote  different  functions  if  they 
can  be  distinguished  from  the  context  or  through  the  parameters  in  brackets  and  conse- 
quently no  confusion  is  likely  to  arise. 
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particle  corresponding  to  the  energy  E^-  For  example,  in  the  case  of  the  hydrogen 
atom  the  Coulomb  field  of  the  nucleus  V{r)  can  be  included  in  the  Hamiltonian. 
In  such  a  case  energy  is  a  function  of  both  space  and  momentum  variables  and 
the  eigenstate  of  energy  is  not  an  eigenstate  of  space  and  momentum  operators 
separately.  This  implies  that  the  wave  function  which  is  an  eigenstate  of  energy 
is  not  of  the  exponential  type  in  the  configuration  or  momentum  space.  Defining 
the  quantity 


{tp*Fy)  —  y)Vy)*)  {U) 


2im 
we  find  that  it  satisfies  the  continuity  equation 

-g-  +  divj  =  0  (12) 

where  Q  =  ip*yj.  j  can  therefore  be  identified  with  the  current.  If  we  do  not  include 
any  term  representing  the  interaction  then  the  Hamiltonian  corresponds  to  that 
of  a  free  particle  and  energy  is  connected  with  momentum  by  the  relation 

in  the  relativistic  case  and  by 

E  =  p^l2m 

in  the  nonrelativistic  case.  The  choice  of  the  Hamiltonian  and  the  wave  function 
must  be  such  that 

y){x,  y,  z,,  t)  =  y){x,  y,  z)e~*^p^/*      and     Hy){x,  y,  z)^Ey){x,  y,  z)       (13) 

are  satisfied.  In  the  present  discussion  we  have  considered  ^  to  be  a  complex 
function  and  in  such  a  case  the  Hamiltonian  in  momentum  space  for  a  free  non- 
relativistic particle  is  p^l2m.  On  the  other  hand  if  we  postulate  that  the  wave 
function  is  represented  by  a  set  of  ^'s,  the  choice  of  the  Hamiltonian  depends 
on  the  energy-momentum  relation  and  on  what  properties  of  the  free  particle 
we  want  to  include  in  ip.  For  example,  if  we  wish  to  include  spin  in  the  non- 
relativistic case  by  defining  a  two  component  wave  function,  then  H  will  involve 
2x2  matrices  in  a  manner  as  to  yield  the  energy  of  the  system  when  operating 
on  ^.  If  we  wish  however  to  satisfy  the  demands  of  relativity  and  spin  then  yj 
must  have  four  components  and  H  consequently  will  involve  4X4  matrices 
suitably  chosen  to  give  the  energy  when  operating  on  ip. 

The  equation  of  motion  (8)  need  not  have  stationary  solutions.  In  fact  this 
is  the  starting  point  of  collision  or  scattering  theory  where  the  interaction  is  such 
that  we  do  not  obtain  bound  state  solutions.  In  such  a  case  the  Hamiltonian  is 
split  into  two  parts  Hq  and  H^  and  the  evolution  of  the  system  from  t  =  —  oo 
to  ^  =  +  oo  is  studied  under  the  assumption  that  Hq  is  time  independent  and  W 
is  such  that  it  vanishes  at  ^  =  — oo  and  ^  =  +  oo.  We  are  then  interested  in  transi- 
tions from  eigenstates  of  Hq  at  ^  =  — oo  to  those  at  ^  =  +  oo. 


pis  the  gradient  operator. 
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Having  included  the  exponential  time  dependence  in  a  free  particle  wave 
function,  we  can  define  a  transform  for  y){x,  y,  z,  t)  as 

A{p,  t)  =  {27th)'^'^fw{x,t)e-m(P-^-^pi^d^x  (14) 

where  Ep  is  the  relativistic  energy  of  the  free  particle  with 

El  =  p^c^  +  m^c*.  '  (15) 

We  immediately  recognize  that 

q){p,  t)  =  A{p,  t)e-i^i'K  (16) 

In  terms  of  A  {p,  t)  the  inverse  relation  can  be  written  as 

y>{x,  t)  =  {2  7th)~^'"iMP,  t)e^<^I^XP-^-^vt)d?p.  (17) 

A  particular  case  of  the  wave  function  is  that  of  a  free  particle  with  a  definite 
momentum  and  energy  p^,  Eq.  The  wave  function  in  configuration  space  would 
then  be  represented  by 

^_p(t/A)(p,.a--^,0^  (18) 

Its  momentum  representation  is  given  by 

Let  us  now  calculate  the  mean  and  mean  square  deviations  of  x  and  p  corre- 
sponding to  the  general  wave  function  y).  The  Fourier  relationship  between  (p  and  tp 
forces  upon  us,  as  a  necessary  mathematical  consequence,  a  relation  between 
these  two  deviations  which  is  known  as  the  uncertainty  principle.  We  first  cal- 
culate Pi    *     ^ 

^=.fp^(p*(pd^p  =  h^f-^ ^d^x  (20) 

J  J    ox     ox 

and 

^=fx^w*y)d^x=h^f^*    ^d^p  (21) 

J  J    dp      dp 

If  {x-xf  =  (AxY;         P^={p-pf  =  (Ap)\  (22) 

Working,  without  loss  of  generality,  in  a  system  in  which  ^  =  0,  ^  =  0  so  that 

{Apf  =  p^;      {Axf  =  xK 
Let  us  consider 

2)=       "^    -  •    ^^ 


2p  ^^    dx 
i.e. 


0,  (23) 


D  = WW*  A w  — h w*  — ^   4-  —^ ^— 

4f^2\2  ^^         23^\       ^^  dx      '    dx     dx 


I  (  x\^        .        I     d   /  X  \       11 


^       dy)     dy)* 

dx      dx 

dtp    dip"^ 


i-(^)^-f^'-2-^]w*+j-^(-j¥'v*)+-g  3^ 


(24) 
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"^^"^  fDd^x  =  ^p^ ~^0  (25) 

and 

^2 

p^:^  =  {Apf{Axf^^-  (26) 

More  generally  such  a  relationship  exists  for  all  canonically  conjugate  variables. 
The  other  physically  interesting  relationship  is  between  energy  and  time.  The 
meaning  of  AE,  the  uncertainty  in  energy,  is  obvious  since  energy  is  a  function 
of  momentum  and  space  and  its  fluctuation  can  be  computed  from  the  wave 
function.  However  the  physical  interpretation  of  Zl  /  is  not  so  obvious  since  the 
wave  function  is  defined  at  a  particular  time  and  till  now  we  have  not  adverted 
to  ^  as  a  parameter  with  a  probability  distribution.  For  the  non-relativistic 
particle 

E  =  p^l2m  =  imv^;     AE  =  mAv  (27) 

(where  v  is  the  velocity)  and  hence  from  the  uncertainty  principle 

Ap  •  Ax  ^:>^  h 

Ax 
we  can  write  AE  At  ^=^h  where  A  t  = is  the  time  taken  for  the  particle  to 

travel  a  distance  equal  to  its  spatial  extension  A  x.  This  is  equivalent  to  saying 
that  it  is  the  time  for  the  creation  of  the  particle  provided  we  identify  a  wave 
pulse  of  spread  A  x  with  the  particle  itself.  The  energy  E  of  the  particle  is  then 
to  be  attributed  not  to  any  particular  time  but  only  to  a  time  point  lying  anywhere 
in  the  interval  A  t,  the  error  involved  in  the  energy  being  A  E. 

The  principle  of  superposition^ 

The  wave  functions  ip  and  q?  expressed  in  configuration  and  momentum  space 
respectively  are  representations  of  the  same  state  of  the  system.  It  has  naturally 
been  found  convenient  to  conceive  of  the  state  of  the  system  without  any  reference 
to  the  particular  representation  and  interpret  the  wave  functions  ip  and  (p  for 
example,  as  projections  of  the  state  in  the  configuration  and  momentum  spaces. 
The  dynamical  variables  are  operators,  their  representations  depending  on  the 
space  in  which  we  wish  to  describe  the  states.  Since  the  eigenvalues  of  these 
operators  are  real,  we  require  the  operators  to  be  hermitian.  A  set  of  states  for 
example  can  be  obtained  as  the  eigenstates  of  a  suitably  chosen  operator  a.  If  oc 
is  an  eigenvalue  of  ^,  it  will  be  independent  of  the  representation  of  the  operator^. 
The  eigenstate  can  be  symbolically  represented  by  the  "ket"  vector  |^>^ 

i.e.  ocWy  =  oc  \ocy.  (28) 


^  P.  A.  M.  DiRAC,  The  Principles  of  Quantum  Mechanics,  Oxford  University  Press,  London 
(1947). 

2  P.  T.  Matthews,  Lectures  delivered  at  the  University  of  Rochester  on  "The  Relativistic 
Quantum  Theory  of  Elementary  Particle  Interactions"  (1957). 
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A  complete  system  of  eigenvectors  is  obtained  by  taking  all  the  eigenvectors 
corresponding  to  all  the  eigenvalues.  Defining  the  "  bra"  vector  <(X  |  as  the  hermitian 
conjugate  of  the  ket  vector  |a>,  the  orthonormality  relation  can  then  be  written  as 

<^'|0  =  (5,.,^'  (29) 

where  da'oc"  is  the  Kronecker  delta  function  if  oc'  and  (x"  belong  to  a  discrete 
spectrum  and  is  the  Dirac  delta  function  if  «;'  and  oc"  belong  to  a  continuous 
spectrum.  The  completeness  relationship  is  expressed  by 

/|^'><^'|=1  (30) 

«' 

where  /  is  a  summation  or  an  integration  over  the  variable  oc' . 

The  principle  of  superposition  of  states:  Any  given  state  |y)>  of  a  system  can  be 
expressed  as  a  superposition  of  a  complete  set  of  eigenstates,  i.e.  |^>  can  be 
written  as : 

\wy  =  2oic\ocj,y.  (31) 

k 

I  \py  may  be  the  eigenfunction  of  an  operator  other  than  oc  but  we  are  not  con- 
cerned with  it  here.  C;^.  is  the  amplitude  for  obtaining  the  eigenvalue  (Xj.  in  a 
state  \\py.  /     I    \  /oo\ 

In  quantum  mechanical  problems  we  deal  with  a  |^>  which  can  be  expressed 
as  #  \iy  where  F  is  an  operator  and  |*>  the  eigenstate  corresponding  to  another 
operator  G.li  F  and  G  commute  F  \iy  is  an  eigenstate  of  G  and  hence  the  pro- 
jection oiF\i)  on  any  state  other  than  |i>  is  zero.  However  if  i^  does  not  commute 
with  G  then  the  projection  of  F  \iy  on  a  state  other  than  |^>  say  |/>  is  non-zero, 

'•^*  <f\F\iy  =  Cf^O.  (33) 

This  is  also  called  the  matrix  element  of  i^  between  |i>  and  |/>.  From  the  ortho- 
normality  relations  we  have  c*C]^  to  be  the  probability  of  obtaining  the  eigenvalue 
ajc  in  the  state  ip.  In  particular  the  amplitude  that  a  single  particle  has  momentum 
^  is  <p|^>,  identified  to  be  (pi'p)  and  the  corresponding  amplitude  that  it  is  at  a? 
is  (^x\\py  equal  to  ip(x).  Thus  we  have  the  amplitude  for  a  particle  to  be  at 
X  when  it  has  a  momentum  f  to  be  ix\'py'^.  This  is  equal  to  (27r^)~^/2gtpa-/ft 
From  this  it  follows  that 

ip  {x)  =  <a;  I  ■^>  =jix  \'P)<V\  V>  ^^P 

=  {2  7t  h)-  ^'^fe'^  ■  ^"  <P (P)  ^^P'  (34) 

^  <x|p>  represents  the  wave  function  of  a  particle  of  momentum  p.  This  implies  a  normali- 
zation corresponding  to  y^ — ^  particles  per  unit  volume  or  one  particle  in  a  volume  of 

(2  nY  units.  Later  in  dealing  with  collision  processes,  we  will  be  requiring  a  normalization 
of  one  particle  per  unit  volume  and  hence  <x|^>  will  be  taken  to  be  e^P-^ . 
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Similarly  ^ [p)  =  ip\xpy  =fip \ xy  ix | '^> d^ a; 

The  probabilistic  interpretation  combined  with  the  hermiticity  of  the  operators 
implies  that  by  making  a  unitary  transformation  upon  a  state  all  equations  stay 
invariant,  or  equivalently  theunitary  transform  of  an  eigenstate  is  still  an  eigen- 
state  with  the  same  eigenvalue.  Let  U  be  any  unitary  operator  which  takes 
/\  to  (X,  i.e. 

C7^C7-i  =  a.  (36) 

Conservation  of  probability  requires    UU'^  ==  1.   We  have 

(x\(xy  =  (x\ocy 

UocU-'^U\(xy  =  ocU\ocy.  (37) 

Put 

u\ocy  =  \ocy 

which  gives 

3c\ocy  =  oc\'Siy.  (38) 

When  the  unitary  operator  U  differs  from  unity  by  an  infinitesimal  quantity  we 
can  set 

U  =  l—i£F.  .  (39) 

Thus 

U^  =  l^ieF.  (40) 

The  unitarity  condition  on  U  requires  F  to  be  hermitian.  Under  this  infinitesimal 
unitary  transformation  an  arbitrary  operator  oc  goes  into 

a=UocU-'^  =  {l—isF)(x{l-hisF) 

=  oc  —  i£[F,  oc] 
or 

^  —  oc  =  —ie[F,  oc].  (41) 

The  above  equation  of  motion  is  also  true  in  classical  dynamics  if  the  commutator 
is  interpreted  as  the  Poisson  bracket  and  U  the  generating  function  of  a  contact 
transformation. 

The  principle  of  complementarity  includes  in  a  strict  sense  the  principle  of 
superposition.  The  postulate  that  (p{p)  is  a  Fourier  transform  of  yj{x)  implies 
that  (pip)  is  a  superposition  of  eigenstates  of  momentum  with  amplitude  ip{x). 
It  is  customary  to  work  with  a  complete  set  of  free  particle  states  which  are 
chosen  to  be  the  eigenstates  of  momentum  or  equivalently  energy. 

While  the  square  of  the  modulus  of  the  amplitude  has  a  probabilistic  inter- 
pretation it  is  important  to  note  that  a  knowledge  of  c*C;r.  for  all  k  is  insufficient 
to  describe  the  state  y).  This  is  merely  another  way  of  saying  that  the  principle 
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of  superposition  in  quantum  mechanics  applies  to  wave  functions  and  not  to 
their  moduH.  The  mean  value  of  any  dynamical  quantity  for  the  state  ij)  is 

<y)\A\w>  =  U  c^cl<m\A\ny  (42) 

m,n 
with 

<mM|7i>  =  <^^,|^|^,>  (43) 

i.e.  we  should  know  all  the  amplitudes  c^  or  equivalently,  coefficients  not  merely, 
for  the  case  m  =  n  but  for  all  m  and  n.  The  terms  d^^c^  are  called  the  elements  of 
the  density  matrix  g,  the  trace  of  ^  being  unity.  The  expectation  value  of^  by 
the  above  equation  is 

<y)\A\ipy  =  T,[gA]  (44) 

where  the  elements  of  [A]  are  given  by 

A^,,  =  <:m\A\ny.  (45) 

In  the  quantum  mechanical  description  of  single  particles,  it  is  also  possible 
to  include  attributes  like  spin  which  can  be  imbedded  in  the  free  particle  wave 
functions.  Other  properties  like  charge  can  be  ascribed  as  has  been  stated  at  the 
outset  only  when  we  consider  the  particle  under  interaction  with  external  fields. 
The  concept  of  interaction  in  dealing  with  elementary  particles  in  a  wave  function 
formalism  is  derived  through  the  inclusion  of  suitable  interaction  terms  represent- 
ing classical  fields  in  the  Hamiltonian.  This  results  either  in  solutions  which  are 
eigenstates  of  this  Hamiltonian  representing  bound  states  or  transitions  of 
various  orders  between  eigenstates  of  the  free  particle  Hamiltonian.  In  a  more 
rigorous  sense,  it  is  difficult  to  distinguish  between  properties  imbedded  in  the 
free  particle  wave  function  and  those  described  through  interaction.  For  example, 
mass  which  is  included  in  the  free  particle  wave  function  is  interpreted  as  a 
"self-energy  effect"  due  to  interaction  with  a  field  associated  with  the  particle 
itself.  Similarly  charge  can  be  introduced  in  the  free  particle  wave  function  using 
an  isotopic  spin  formalism  by  increasing  the  components  of  the  wave  function 
in  a  suitable  manner.  In  fact,  new  techniques  have  been  developed  to  describe 
particles  which  are  identical  as  regards  a  certain  class  of  interactions  but  behave 
differently  under  other  interactions.  This  leads  to  the  more  difficult  and  unsolved 
problem  of  the  meaning  of  an  elementary  particle  and  its  properties,  as  will  be 
discussed  in  detail  in  later  chapters. 

If  we  were  to  require  the  annihilation  or  creation  of  a  particle  through  inter- 
action, the  wave  function  formalism  would  be  unsuitable  and  a  field-theoretic 
description  involving  an  operator  calculus  is  forced  upon  us.  In  this  formalism, 
even  the  transition  of  the  same  elementary  particle  from  one  state  to  another  is 
treated  as  an  annihilation  of  the  one  and  the  creation  of  the  other.  We  shall  in 
the  next  few  chapters  be  concerned  only  with  single  particle  wave  functions  and 
the  concept  of  interaction  as  introduced  through  the  perturbation  theory. 

Besides  introducing  attributes  like  spin  and  isotopic  spin  in  the  wave  function 
one  important  mathematical  requirement  will  be  imposed,  viz.  the  equations  of 
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motion  should  be  co variant  under  Lorentz  transformations.  This  is  also  physically 
reasonable  since  the  laws  of  physics  should  be  describable  in  the  same  manner 
in  all  inertial  frames  of  reference.  The  immediate  conseqence  of  the  application 
of  the  principle  of  relativity  is  the  possibility  of  the  existence  of  negative  energy 
states  in  view  of  the  quadratic  relation  between  energy  and  momentum.  Thus 
another  restriction  has  to  be  imposed  on  the  physical  grounds  that  such  negative 
energy  particles  do  not  exist.  This  is  taken  care  of  by  a  suitable  interpretation 
of  these  states.  Particles  which  exist  in  positive  energy  states  continue  to  do  so 
in  the  absence  of  interaction.  If  the  interaction  implies  the  absorption  of  a  negative 
energy  particle  then  this  absorption  is  interpreted  as  the  emission  of  a  positive 
energy  antiparticle,  the  antiparticle  being  defined  by  this  interpretation  itself. 
If  the  particle  and  antiparticle  are  identical  as  in  the  case  of  photons,  we  just 
t^^ke  the  emission  of  a  negative  energy  photon  to  be  the  absorption  of  a  positive 
energy  photon.  In  perturbation  theory  absorption  should  succeed  emission  and 
hence  the  emission  of  a  negative  energy  photon  should  succeed  the  absorption. 
There  is  however  a  very  important  principle  governing  the  behaviour  of  an 
assembly  of  similar  particles  known  as  the  Pauli  principle,  which  states  that  it 
is  impossible  to  find  two  particles  in  the  same  state.  Particles  which  obey  this 
rule  are  called  fermions  while  the  rest  are  called  bosons.  In  a  field  theoretic 
formalism,  the  Pauli  principle  is  introduced  through  commutation  relations 
between  field  operators  associated  with  the  particle.  In  a  wave  function  formalism 
it  cannot  be  introduced  in  the  description  of  a  single  free  particle.  However  the 
statistical  behaviour  in  an  assembly  of  similar  particles  which  can  be  described 
in  the  wave  function  formalism  when  the  total  number  of  particles  is  not  changing, 
depends  upon  whether  the  particles  are  bosons  or  fermions.  In  the  case  of  fermions, 
the  requirement  that  no  two  particles  should  occupy  the  same  state  can  be  satis- 
fied by  choosing  the  wave  function  to  be  a  Slater  determinant 


w 


yk\ 


41^  v^f  . . .  wf 


Wk'  wf  ' ' •  ¥k 


(46) 


where  \p^^^  represents  the  i  th  particle  in  an  assembly  of  k  particles,  in  the  ;  th  state. 
This  implies  that  the  wave  function  should  be  antisymmetric  under  an  exchange 
of  two  particles.  One  of  the  most  important  applications  of  the  Pauli  principle 
is  the  suitable  interpretation  of  negative  energy  states  of  fermions  through  the 
postulate  of  an  unobservable  "sea"  of  negative  energy  particles  and  the  consequent 
introduction  of  the  concept  of  the  antiparticle. 

When  dealing  with  interactions  of  elementary  particles,  a  single  particle 
formalism  with  the  inclusion  of  negative  energy  states  therefore  leads  to  the 
creation  and  annihilation  of  particles  and  antiparticles  and  necessitates  the  use 
of  field-theoretic  methods.  But  the  single  particle  formalism  can  still  be  retained 
by  the  Feynman  interpretation  of  negative  energy  states  by  a  suitable  prescription 
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for  tracing  the  sequence  of  events  when  considering  the  transitions  of  various 
orders  in  an  interaction. 

Thus  the  theory  of  elementary  particles  is  presented  in  the  following  order. 
We  first  give  a  systematic  account  of  free  particle  wave  functions  of  various 
elementary  particles  with  the  corresponding  attributes  imbedded  suitably  in  the 
wave  function  subject  to  the  demands  of  relativity.  Interaction  is  then  described 
through  perturbation  theory  for  a  non-relativistic  system  and  then  by  introducing 
the  concept  of  negative  energy  the  Feynman  formalism  is  dealt  with  in  great 
detail.  This  leads  naturally  to  the  field-theoretic  description  of  interactions. 

2.  SINGLE  PARTICLE  WAVE  EQUATIONS 
Spin  and  Pauli's  equation 

We  have  till  now  considered  the  wave  function  of  a  particle  without  introducing 
the  concepts  of  spin  and  the  requirements  of  relativity.  We  shall  first  deal  with 
the  concept  of  spin,  one  of  the  intrinsic  attributes  of  a  fundamental  particle. 

Before  we  introduce  the  concept  of  spin,  we  shall  discuss  the  nature  of  the 
dynamical  variable  representing  angular  momentum^.  In  classical  mechanics, 
the  components  of  angular  momentum  associated  with  a  particle  at  the  point 
X,  y,  z,  are  given  by 

^x  =  yPz  —  ^Py',       My  =  ^Px  —  ^Vz\       ^z  =  ^Py  —  yPx'  (1) 

In  wave  mechanics  where  the  concept  of  a  point  particle  is  replaced  by  that 

of  a  wave  function,  the  corresponding  angular  momentum  operators  are  con- 

d  d  d 

structed  by  replacing  p^,  Py  and  p^  by  the  operators  —  ^-r— ,  — i— —  and  — *~^~~ 

respectively  when  the  wave  function  is  expressed  in  configuration  space.  Using  the 
well-known  commutation  relations  for  the  components  of  position  and  linear 
momentum  of  a  particle  we  can  readily  derive  the  commutation  rules  for  the 
components  of  the  angular  momentum  operator  which  read 

[M„My]^iM,;     [My,M,]  =  iM,;     [M,,  M,]==i M^,  (2) 

The  square  of  the  total  angular  momentum  is 

M^  =  Ml  +  Ml  +  MI.  (3) 

As  can  be  seen  easily,  this  operator  commutes  with  M^,  My  and  M^. 
Now  if  we  define  the  non-hermitian  operators  ilf  +  and  M_  as 

M^  =  M^-\-iMy;     M_  =  M^  —  iMy  (4) 

they  obey  the  commutation  relations 

[M\M^]  =  0     [M„M,]  =  M, 

[M„  M_]  =  ~M_;     [M^ ,  M_]  =  2M,.  (5) 

1  G.  Wentzel,  Lectures  dehvered  at  the  Tata  Institute  of  Fundamental  Research,  Bombay, 
on  "Special  Topics  in  Quantum  Mechanics"  (1956). 
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Since  M^  an  dJf^  commute,  we  can  choose  a  wave  function  which  is  an  eigen- 
function  of  both  M^  and  M^  simultaneously  and  let  us  represent  the  eigenvalue 
of  these  operators  by  J/|  and  m.  As  we  are  dealing  with  the  simultaneous  eigen- 
function  of  J^P  and  M^ ,  we  can  characterize  such  an  eigenstate  by  two  quantum 
numbers  j  and  m,  j  being  related  to  Mf.  Just  from  the  use  of  the  commutation 
relations,  we  can  obtain  the  values  of  Mf  and  the  possible  values  of  m.  We  shall 
show  that  Mf  =  j[j  +  1)  and  that  m  can  assume  (2;'  +  1)  values  from  — j  to  -\-j, 
j  being  positive.  Taking  the  matrix  element  of  ilf'^  and  M^  between  two  states 
jm  and  j'm'  we  have 

irm'\M^\jmy  =  Mfd^yd^^>  (6) 

<f  m'  \M^\  jmy  =  mdjj'  d^^'.  (7) 

From  the  commutation  relation 

we  obtain  the  condition 

<:fm'\M_^\jmy{Mf>  —  Mff=:0  (8) 

which  implies  that  the  matrix  element  of  M+  exists  only  for  ;  =  f. 
The  commutation  relation 

[M„M,]^M^ 
leads  to 

(m'  —  m)  <m'  |  M^  \  m>  =  <m'  |  if  ^  |  m>  , 
i.e. 

[m'  -m-  1)  <m'  I  Jf+ 1  m>  =  0  (9) 

which  means  that  the  operator  M+  switches  the  state  m  to  m  +  1,  and  similarly 
it  can  be  shown  that  the  operator  M_  switches  the  state  m  to  m  —  1 .  It  is  to  be 
noted  that  the  various  values  of  m  differ  from  the  previous  one  by  unity. 
Finally  it  can  be  shown  that  the  commutation  relation 

[M^,M_]-2M,=  0, 

leads  to  a  difTerence  equation  from  which  the  entire  spectrum  of  eigenvalues  of 
M^  and  M^  can  be  abstracted.  From  the  above  equation  we  have 

2<jm\M^  \jm"yijm"\M_  \jm'y  —  ijm\M_  \jm"yijm"\M^  \jm'y  =  2mdmm'-i\^) 

m" 

Noting  that  <m  \M+  \m"y  is  the  complex  conjugate  of  <m"  \M^  |m>  and  writing 
for  convenience  </m  lilf^  \jm  —  1>  =  Nj^,  we  have 

i-»f,-»r-iAWii'=2w.  (11) 

The  solution  of  the  above  difference  equation  is  given  by 

\Nim\^  =  Cj-m{m-\)  (12) 

Now  two  numbers  m^  and  m^  with  m^  >m2  can  be  found  such  that 

\Ni„,,\'  =  0  (13) 

for     m^rrii     or     m<m2 
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i.e.  the  permissible  values  of  m  range  from  Wg  to  m^  and  no  eigenstates  exist 
with  m  >  m^  or  m  <  m^.  The  above  equation  is  equivalent  to  writing 

and 


l^,-m.i^  =  0 


(i.e.)  mj  +  1  and  m^  are  the  roots  of  the  equation 


or 


Cj  +  m{m  —  1)  =  (m  —  mg)  (m 


m. 


1)- 


Equating  coefficients  of  same  powers  of  m  on  both  sides,  we  obtain 
mg  =  — m^;  Cj  =  m-^  (m^  +  1) 


and 

Writing  m^  =  j  we  have 


m^  (mj  +  1)  —  m  (m  —  1) . 
^  =  j{j  -\-  1)  —  m{m  —  1). 


(14) 

(15) 
(16) 


(17) 
(18) 


Now  m  can  take  the  values 

^-j,-j-^l,..,j-l,j  (19) 

i.e.  2j  must  be  an  integer  or  j  may  be  an  integer  or  half  integer. 

Experimental  evidence  necessitated  the  assumption  of  an  intrinsic  angular  momen- 
tum to  some  fundamental  particles,  i.e.  an  angular  momentum  which  is  an  inseparable 
attribute  like  the  mass  or  charge.  The  square  of  this  angular  momentum  for  an 
electron  is  known  to  be  f  which  should  be  identified  with  j  ^  \  since  we  know 
that  the  total  angular  momentum  operator  M"^  has  the  eigenvalue  j{j  +  1). 
Hence  m  can  take  the  values  di  J .  The  operators  M^  and  My  connect  m  =  | 
and  m  =  —  \  state  and  vice  versa  while  M^  connects  m  =  ±  J  with  m  =  ±  J 
respectively. 

Since  "spin"  is  attributed  to  a  particle  and  cannot  be  traced  to  its  spatial 

a 

distribution  it  is  clear  that  we  cannot  use  differential  operators  like  — i— —  as 
we  did  in  the  case  of  angular  momentum. 

Assuming  that  the  commutation  relationships  define  completely  the  nature  of 
the  angular  momentum  operators  and  that  these  operators  must  connect  states  of 
eigenvalues  ±  J  it  is  possible  to  convince  ourselves  that  the  above  requirements 
can  be  satisfied  if  the  spin  states  are  represented  in  the  two  component  form  as 


and 


0 

T2J 


the  eigenvectors  corresponding  to  the  eigenvalues  ±  \  respectively 


and  the  angular  momentum  operators  are  represented  by  the  matrices/ 


G^  — 


0  1 

0-* 

1   0 

1  0. 

.;    s  = 

i  0  . 

;    Oz  = 

0-1 

(20) 


1  W.  Pauli,  Handbuch  der  Physik,  Vol.  5,  Springer- Verlag  (1958). 
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It  can  be  easily  verified  that  these  matrices  satisfy  the  same  commutation 
relations  as  the  components  of  angular  momentum  and  also  that  these  matrices 
connect  the  states  referred  to  as  before.  Any  spin  state  of  the  system  is  a  linear 

0 


combination  of 
spin  J  should  be 


and 


Thus  the  Schrodinger  equation  for  a  particle  of  ' 


i^  =  Hy>  (21) 


where  H  will  involve  the  2X2  spin  matrices  in  a  manner  that  the  angular  momen- 
tum operator  commutes  with  it.  Pauli  showed  that  such  a  Hamiltonian  can  be 
obtained  by  replacing  jt^  in  the  Schrodinger  equation  by  the  matrix  {O  •  Jt)2. 
Pauli 's  equation  reads : 

Wy)  =  ^{(J'Jtfyj+Vyj  (22) 

2m 

where  Jr  =  {p  —  eA)  and  V  =  ecp  and  W  is  the  kinetic  energy  where  A  and  cp  are 
the  vector  and  scalar  potentials  of  the  electromagnetic  field  in  which  the  particle 
is  moving.  Using  the  general  formula  that  if  B  and  C  are  any  3 -dimensional 

vectors 

{6'B){a'C)  =  {B'C)-\-i{0'BxC)  (23) 

(in  our  case  B  =  C  =  p  —  eA)  and  remembering  that  curl  A  =  H  (where  H 
is  the  magnetic  field),  we  can  write  Pauli's  equation  in  the  form 

Wy)  =  -^[7i^-}-e(J-H]y)+Vy).  (24) 

The  Hamiltonian  given  by  (24)  is  similar  to  the  classical  Hamiltonian  for  an 

electron  in  a  field  except  for  the  term  — — O  •  H,  which  arises  only  in  quantum 
theory. 

This  additional  energy  can  be  attributed  to  an  intrinsic  magnetic  moment  of 

the  electron  given  by  — — <J ,  since  an  electron  which  has  a  spin  angular  momentum 

I  in  a  particular  direction  will  have  a  magnetic  moment  in  the  same  direction. 

el 
The  magnetic  moment  associated  with  orbital  angular  momentum  I  is  — —  or 

I  Bohr  magnetons.  The  ratio  of  magnetic  moment  in  units  of  Bohr  magnetons 
to  the  angular  momentum  is  called  the  gjnromagnetic  ratio  and  in  the  case  of 
spin  it  is  twice  that  associated  with  orbital  angular  momentum,  i.e.  the  magnetic 

moment  is  not  given  by  -x —  but  by  -^ — gs.  For  the  electron  g,  the  gyromagnetic 

ratio  is  approximately  equal  to  2  since  the  magnetic  moment  is  found  to  be 
approximately  one  Bohr  magneton.  For  a  free  particle  (24)  reduces,  if  we  retain 
the  first  form  of  the  equation,  to 

Wip  =  ^{a-pf'ip.  (25) 

EPOR      2 
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Expanding  the  right  hand  side,  we  note  that  the  commutation  relations  for 
G^,Gy, Oz  are  just  the  conditions  for  the  above  equation  to  reduce  to  the  form  of 
Schrodinger  equation  remembering  of  course  that  ip  has  two  components.  In  the 
free  particle  case  each  of  these  components  satisfies  the  Schrodinger  equation  while 
equation  (25)  is  satisfied  only  by  the  two  component  xp.  It  can  be  verified  that 
only  the  square  of  the  sum  of  the  orbital  and  spin  angular  momenta  commutes 
with  the  Hamiltonian  and  not  of  each  of  them  separately.  In  this  way,  the 
concept  of  spin  is  deduced  from  the  equation  postulated  first. 

It  would  be  worth  emphasizing  the  peculiar  features  of  angular  momentum  in 
quantum  mechanics  as  contrasted  with  angular  momentum  in  classical  mechanics. 
In  the  latter  case  the  angular  momentum  of  a  particle  is  represented  by  a  vector  M. 
But  such  a  vector  representation  for  the  total  angular  momentum  is  not  possible 
in  quantum  mechanics  for  the  following  reason.  If  we  are  dealing  with  a  particle 
in  a  definite  state  of  total  angular  momentum,  i.e.  represented  by  an  eigenstate 
of  Jf  ^,  it  is  possible  to  choose  a  state  which  is  an  eigenstate  of  both  the  operator 
corresponding  to  the  total  angular  momentum  M ^  and  the  operator  corresponding 
to  its  component  in  any  specified  direction  (which,  without  loss  of  generality,  can 
be  chosen  to  be  the  ^-direction),  but  this  function  is  not  an  eigenstate  oiM^  or  My 
but  only  of  3/|  +  My.  This  implies  that  we  cannot  represent  the  total  angular 
momentum  by  a  vector  as  we  do  in  classical  mechanics,  since  if  we  did  so,  the  three 
components  would  be  determined  and  this  contradicts  the  above  hypothesis. 

Later  in  dealing  with  general  principles  of  invariance  of  equations  of  motion 
of  fundamental  particles  we  shall  show  how  the  concept  of  angular  momentum 
and  spin  arise  from  the  transformation  properties  under  space  rotation. 

Relatiyistic  requirements  and  the  Klein-Gordon  equation 

We  shall  now  turn  to  the  demands  of  relativity  which  can  be  stated  in  the 
simplest  terms  as  follows.  According  to  the  principle  of  relativity,  the  laws  of  the 
fundamental  particles  should  be  the  same  in  all  systems  of  reference  moving  with 
uniform  velocity  with  respect  to  each  other.  Thus  the  equations  which  express 
these  laws  should  be  co variant  under  Lorentz  transformations.  The  Schrodinger 
equation  of  a  free  particle  which  we  obtained  as  the  quantum  mechanical  analogue 
of  the  classical  non-relativistic  equation  connecting  energy  and  momentum 
is  obviously  not  invariant  under  the  Lorentz  transformation  since  it  is  of  second 
order  in  spatial  coordinates  and  of  first  order  in  the  time  coordinate. 

To  obtain  a  relativistically  invariant  equation,  we  start  from  the  classical 
relativistic  relation  between  the  energy  and  momentum  of  a  free  particle 

^2^;p2_pm2     or     ^2^m2.  (26) 

In  quantum  mechanics  the  corresponding  equation  for  the  wave  function  xp  in 
configuration  space  is  given  by 

[Vl-m^-\xp^0  (27) 

where  ^4  =  ^— —     and     Pi=^ — iVi- 

dt 
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This  equation  known  as  the  Klein-Gordon  equation  was  first  discovered  by 
Schrodinger.  It  satisfies  the  demands  of  relativity  but  does  not  include  the  concept 
of  spin.  In  the  presence  of  an  electromagnetic  field  described  by  the  4- vector 
potential  Af^  the  4- vector  p^^  is  replaced  hj  jz^^  =  p^,  —  e^^^.  Defining  the  'current' J 
as 

{ip*  Vip  —  y)Vy)*}  (28) 


2im 

the  charge  density  q  which  should  satisfy  the  continuity  equation  along  with  j 
is  given  by 

It  is  to  be  noted  that  while  in  the  case  of  the  non-relativistic  Schrodinger  equation  g, 

defined  as  ^*^,  satisfied  the  continuity  equation,  in  this  case  it  is  only  the  g 

defined  as  above  that  satisfies  such  an  equation.  Since  the  Klein-Gordon  equation  is 

of  second  order  in  the  time  derivative,  its  solution  can  be  obtained  only  by  specifying 

dw 
two  initial  conditions,  i.e.  w  and  -;—-  at  some  time  t.  As  these  can  be  chosen 

^  dt 

independently,  q  may  assume  positive  as  well  as  negative  values.  This  defect  was 

realised  even  when  the  equation  was  originally  proposed  by  Schrodinger.  Later 

Pauli  and  Weisskopf^  were  able  to  interpret  the  Klein-Gordon  equation  as 

a  field  equation  and  q  as  the  difference  between  the  densities  of  two  types  of 

particles  with  opposite  charges.  This  interpretation  can  be  understood  in  the 

wave  function  formalism  by  rewriting  the  Klein-Gordon  equation  in  the  following 

manner^.  It  is  well  known  that  a  second  order  equation  in  ip  is  equivalent  to 

two  simultaneous  first  order  equations  in  two  independent  variables  ip  and  -^— 

or  more  generally  in  two  independent  linear  combinations  of  tp  and  — — .  In  parti- 
cular defining  0  and  x  by 

we  see  that  0  and  x  satisfy 
.30         I 


dt        2m 


dt        2m 


I 


eA 


\^  +  X)-^{ecp^m)0  (31) 


{^  +  X)  +  {e(p-m)z.  (32) 


The  above  set  of  equations  can  be  written  in  the  matrix  form 


dt 


H  W  (33) 


1  W.  Pauli  and  V.  Weisskopf,  Helv.  Phys.  Acta  7,  709  (1934). 

2  H.  Feshbach  and  F.  Villars,  Rev.  Mod.  Phys.  30,  24(1958). 
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1 


(34) 


and  Gj^,Gy,Gz  are  the  Pauli  spin  matrices  given  by  equation  (20).  W  is  the  two 
component  wave  function  which  satisfies  the  Klein-Gordon  equation  in  the  first 
order  form.  In  terms  of  this  two  component  wave  function  ^  we  define  q  as 

^  =  y^*a,¥^=0*0  — ;^*;j:.  (35) 

Note  that  we  have  introduced  the  matrix  g^  in  the  expression  for  q  and  that  q 
is  the  difference  of  two  positive- definite  quantities.  Defining  another  two  compo- 


nent wave  function  ^^ 


0i 


we  find  that  W^.  satisfies 


dW, 


dt 


^-  =  H(-e)W, 


(36) 


where  H{ — e)  is  the  Hamiltonian  (34)  with  e  replaced  by  —  e.  We  therefore  re- 
cognize Wf.  to  be  the  charge  conjugate  wave  function  oiW.  0  and  ;^  are  determined 
if  we  are  given  W  and  Wc  •  The  density  q  can  be  defined  in  terms  of  W  and  W,,  as : 

Q  =  \[W*<,,W-W:a,'P,-\  =  0*0-X*X,  (37) 

i.e.  as  the  difference  of  two  densities  corresponding  to  particles  of  opposite  charges. 
For  a  given  momentum  p  we  can  write  the  free  particle  solution  of  the  Klein- 
Gordon  equation  as 

I  iij.  I  nrtv 


l  XoiP) 


(39) 


where  p^  is  the  eigenvalue  of  H.  Feeding  this  solution  in  equation  (33)  we  have 
(2>4 -  m)  00  =  b72m)  (00  +  Zo) 
iPi-^  m)  Xo  =  {-P^l2m)  {0^  +  Xo)  • 

The  characteristic    equation    for  p^  is  pi  =  {p'^  +  m^)  =  E^.  Thus  for  a  given 
value  of  p  there  are  two  solutions  corresponding  to  the  two  values  ±  E^  of  p^ 


where  E^ 


+  yp^  +  m^.  The  associated  solutions  are : 

Ej,  +  m 


p,  =  -\-E^:     Wl{p)  = 


Xq  . 


(40) 
(41) 
(42) 

Q  =  W-^W-  =  -\.  (43) 

We  recognise  that  W-(p)  =  Wc{p),  i.e.  the  negative  energy  solution  !F-  is  just 
the  charge  conjugate  solution  ^c  of  the  positive  energy. 


V^ 


E. 


0- 

Xo 


2ymEj 
m  —  Ej, 
2]/mEp_ 

m-Ep 
2-\/mEp 
Ep  +  m 
2]/mEp 
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The  Dirac  equation  ^ 

We  have  seen  that  when  the  non-relativistic  Schrodinger  equation  is  generalized 
to  meet  the  demands  of  relativity,  it  becomes  an  equation  of  the  second  order 
in  the  differential  coefficient  with  respect  to  time.  This  brings  in  its  wake  the 
difficulty  of  the  probability  density  becoming  negative,  a  characteristic  feature 
of  the  second  order  (in  time)  equations.  Such  an  equation  can  be  reduced  to  a  first 
order  equation  with  two  component  wave  functions.  On  the  other  hand,  if  we 
ignore  relativity  but  take  into  account  the  two  possible  spin  states  of  a  spin  half 
particle,  we  are  led  to  a  first  order  (in  time)  two  component  equation. 

To  satisfy  the  demands  of  both  relativity  and  spin  it  is  necessary  either  to 
generalize  the  Pauli  equation  to  satisfy  the  relativistic  requirements  or  the  two 
component  first  order  form  of  the  Klein-Gordon  equation  to  include  spin.  The 
first  procedure  is  easier  and  is  outlined  below.  This  will  be  followed  by  a  formal 
derivation  of  the  Dirac  equation  from  "first  principles". 

We  shall  adopt  a  derivation  due  to  Feynman^  which  consists  in  writing 
a  second  order  relativistic  equation  including  spin  and  expressing  it  in  the  first 
order  form  which  is  the  Dirac  equation.  The  Pauli  equation  was  obtained  by 
replacing  the  term  n^  in  the  Schrodinger  Hamiltonian  by  (O  •  jr)2.  Thus  including 
spin  in  the  second  order  Klein-Gordon  equation  we  expect  to  have 

{7il  —  {0':tf}'(p  =  m'y)  (44) 

where 

^/*  =  :??;,  —  e  ^/. . 

It  is  to  be  noted  that  ip  now  has  two  components.  The  second  order  form  of  the 
Dirac  equation  which  can  be  reduced  to  the  familiar  first  order  form  with  four 
components  should  correctly  be  written  as 

{ti^-\-6 'Jt){7i^  —  a '3t)\p  =  m^\p.  (45)3 


Defining  ;^  by 
we  have 


{7Z^-^a'Ci)\p  =  mx  (46) 

(7r4  —  <J '  ct)  x  =  'inyj.  (47) 

Writing  y^a  =="  W  '^  X  ^^^  Wh  ^  X  —  W  ^^^  above  pair  of  equations  reduces  to 

{y^n^~y':x)ip  =  mip  (48) 

where  y^^i/bi  =  1,  2,  3,  4)  are  the  4X4  Dirac  matrices  defined  as  follows: 


Vi 


I    0 
0-/ 


7  = 


0    a 

-6  0 


and     ^  = 


(49) 


This  is  the  well  known  form  of  the  Dirac  equation. 


^  P.  A.  M.  DiRAC^  Principles  of  Quantum  Mechanics,  Oxford  University  Press,  London 
(1947). 

2  R.  P.  Feynman,  Lectures  delivered  at  the  California  Institute  of  Technology,  on  "Quan- 
tum Electrodynamics"  (1953). 

^  (45)  is  distinct  from  (44)  since  n^  does  not  commute  with  a»7C. 
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If  we  adopt  a  similar  procedure  to  include  spin  in  the  two  component  Klein- 
Gordon  equation,  we  encounter  the  square  root  of  the  operator  n^  in  place  of  n 
so  that  we  cannot  form  a  scalar  product  like  d  •  Jt.  Thus  it  seems  that  the  deri- 
vation of  the  Dirac  equation  is  not  so  obvious  in  this  approach. 

For  the  sake  of  completeness  we  shall  now  give  the  formal  derivation  of  the 
Dirac  equation^.  The  most  general  linear  equation  of  the  first  order  in  space 
and  time  derivatives  satisfied  by  an  N  component  wave  function  (iV  x  1  matrix) 
ip  can  be  written  as 

^"^    "rSo^k—^-^-irriPxp^O  (50) 


where  ^i ,  cvg ,  ^3  and  /S  are  N  x  N  matrices  whose  elements  are  independent  of 
space  time  coordinates.  We  shall  now  obtain  the  conditions  satisfied  by  these 
matrices  merely  by  imposing  the  quantum  mechanical  requirements  on  ip.  We 
define  the  probability  density  q  as  ^*^  where  yj*  the  hermitian  adjoint  of  ^,  is  a 
row  matrix.  In  order  that  q  is  always  positive-definite  and  satisfies  the  equation 
of  continuity,  we  must  have  oc's  and  ^  hermitian,  i.e. 

(«;,)t=«,;      /?t  =  ^.  (51) 

If  we  now  require  that  the  second  order  form  of  the  Dirac  equation  should 
correspond  to  the  Klein-Gordon  equation,  we  obtain  the  following  properties 

of  the  ^'s  and  p. 

(i)  (X,-oci-^ociOCi  =  2dii 
where  Sii  =  0,     i^l 

da  =  \,     i  =  l  (52) 

(ii)  oc,,^-{-^(Xk  =  0     and 
(iii)  {oc,r  =  {^f  =  l. 

Using  these  relations  we  shall  show  that  N  must  be  even.  We  have  from  (ii) 

«i/S  =  -;8a,.  (53) 

Taking  the  trace  on  both  sides  we  have : 

=  -Tr[«,{«,)-'fi]=-Tr^  =  0.  (54) 

If  we  choose  the  ^  matrix  to  be  diagonal,  since  ^^  =  I  its  diagonal  elements 
are  +1  or  — 1.  As  Tr{P)  =  0,  P  must  have  an  even  number  of  diagonal  elements, 
i.e.  P  is  even.  The  a/s  must  also  be  evenasthey  are  of  the  same  dimensions  as  jS. 
It  can  be  shown  that  there  are  only  sixteen  linearly  independent  matrices  obeying 


^  S.  ScHWEBER,  H.  A.  Bethe  and  F.  de  Hoffmann,  Mesons  a7id  Fields,  Vol.  I,  Row, 
Peterson,  Evanston,  Illinois  (1955). 
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the  above  relations.  Consequently  it  is  enough  if  iV  =  4  as  we  can  choose  sixteen 
linearly  independent  4x4  matrices. 

To  cast  the  Dirac  equation  in  a  form  more  symmetric  in  space  and  time  we 

write 

^  =  7,;     pocj,  =  yj,{k=l,2,3)     or     pa  =  y,  (55) 

On  this  definition,  the  yjg  are  anti-hermitian,  i.e.  (yj.)^  =  — yj^  with  (yj^)^  ==  — 1 
while  y^  is  hermitian  with  yj  =  +  1  •  As  before  we  have 

(y^y.+r.y^) =2,5^„.  (56)' 

These  commutation  rules  suffice  to  define  the  Dirac  equation.  However  it  will 
be  convenient  to  represent  the  y-matrices  explicitly  the  standard  representation 
of  which  has  been  given  earlier.  The  Dirac  equation  now  reads : 


|— ^(74-g^  +  r-  F)+mU  =  0. 


(57) 


In  accordance  with  Feynman's  notation  if  A  be  any  4- vector  we  define  A  as: 

A  =  y^A^-y'A==y^A^.  (58) 

Defining  the  4-dimensional   F  as 

•^•"•^^   dx  '    dy  '    dz   '       '"  dx,  ^**''' 

the  Dirac  equation  reads 

{iV—m)y)  =  0  (60) 

id 
Replacing  — —  by  p^^  the  equation  becomes 

{p  —  m)y)  =  0,     ?  =  y^v^-  (61) 

Defining  y)  the    adjoint  wave  function  by  ^  =  xp^y^,  we  find  that  ip  satisfies 
^(p  —  m)  =  0.  The  current-density  4- vector  is  now: 

{Q,j)  =  j^  =  ^y^w  (62) 

Before  discussing  the  solutions  of  the  Dirac  equation  let  us  advert  to  one 
important  transformation  property  of  the  equation  under  a  Lorentz  transformation. 
If  we  define  the  Lorentz  transformation  by  x'  =  Lx  then  the  wave  function 
in  the  new  frame  is  not  obtained  by  merely  replacing  x  by  L"  V  but  also  by 
operating  a  matrix  S  on  0{L~^x'),  i.e.  0'(x^)  =  S0{L-^x').  0'  satisfies  the  same 
equation  as  0  the  differential  coefficients  now  being  with  respect  to  x'  instead 
of  X  and  this  matrix  S  is  defined  by 

V 

where  L,^^  are  the  elements  of  the  matrix  L.  We  shall  continue  this  discussion 
in  the  chapter  III,  section  4. 


1  Here(5y«y  =  0  /t4=v,(5ii  =  <522  =  533  =  — ],<544  =  1.  This  should  be  distinguished  from  5,-j in  (52). 
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Solutions  of  the  Dirac  equation  for  a  free  particle 

From  now  on  we  will  confine  ourselves  to  the  solutions  of  the  Dirac  equation 

for  a  free  particle                             ,            ^         ^  ,«*»v 

^                                        {p  —  m)y)=^0.  (63) 

In  analogy  with  the  free  particle  solution  of  the  Schrodinger  equation  let  us 
assume  the  solution  for  a  given  momentum  p  to  be  of  the  form 

^p  =  u{p)e-^P■^,  (64) 


I.e. 


>r 

~< 

¥^2 
W3 

= 

^3 

\jpj 

L%j 

(65) 


where  the  coefficient  p^  of  t  and  the  eigen  vector  u  [p)  are  to  be  determined  from 
the  condition  that  y)  has  to  satisfy  the  Dirac  equation.  The  equation  for  the  Dirac 
spinor  u  is  obtained  by  feeding  the  solution  in  equation  (63) : 

pip  =  mip;pu=^mu.  (66) 

We  can  rewrite  the  Dirac  equation  in  the  form 

Hu  =  p^u.  (67) 

where  tt      ,  ,   o     ^ 

p^  is  the  eigenvalue  of  («  .  p  +  /5  m)  and  the  characteristic  equation  is  pl  =  p^  +  m^. 
Thus  ^4  can  be  identified  with  the  energy  which  can  take  positive  or  negative 
values  i  E  for  a  given  value  of  p.  To  obtain  the  eigenvector  corresponding  to 
an  eigenvalue  we  must  solve  the  following  set  of  homogeneous  equations 

{E  —  m)u^  —  p,u^—{p^  —  ipy)u^  =  0 

''-  {E  —  m)uz  —  {p^  +  ipy)u^-]rPzU^  =  0 

Pzf^l+{Px  —  iPy)'^2—iE  +  '>^)^3  =  0 
{Px  +  '^  Py)  f^l  —  Pzf^2—  (E  +  f^)  ^(^4  =  ^  ' 

We  can  choose  any  two  of  the  four  components  u^,  u^,  u^  and  ^t4  arbitrarily, 
the  other  two  being  determined  from  two  of  the  four  equations.  The  condition 
that  the  solutions  from  the  two  sets  are  the  same  is  just  that  E^  =  p^  +  w?-. 
There  are  two  forms  in  which  the  two  independent  solutions  can  be  obtained. 

Fwrn  A. 

Taking  E  =  ^jp^  +  m^  we  arrive  at  two  independent  solutions  ^^(1)  and  w(2) 
by  setting  u^^  =  E  +  m,  u^  =  0  and  u^^  0,u<^  =  E  +  m 


(69) 


1 

u{\)A. 

~  E-\-m   ~ 
0 

Pz 
.Pz-^^Py. 

1 

u{2)A 
~        0 

E  +  m 

]lE  +  m 

1/^  +  m 

Px  —  iPy 
_        -Pz 
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The  factor 


is  chosen  on  the  basis  of  a  normalization  corresponding  to 


2E  electrons  per  unit  volume. 

Form  B. 

Alternatively  we  could  have  chosen  u.^=  [E  —  m),u^=  0  and  u^=  0,u^=  E  —  m 
in  which  case  the  solutions  read : 

u{l)B  u{2)B 


]/E 


Pz 

h+'^Py 

E  —  m 

0 

1 


]/E  —  m 


h- 

-iPy' 

- 

-Pz 

0 

E 

—  m 

(70) 


The  above  solutions^  are  valid  even  for  negative  energy  and  in  such  a  case  it  is 
convenient  to  write  ^  as  —  |^  |.  But  it  is  to  be  remembered  that  the  normalization 
corresponds  to  2^  electrons  per  unit  volume  and  this  is  negative  when  the 
energy  is  negative. 

However  if  we  normalize  the  solutions  such  that  there  are  2\E\  electrons  per 
unit  volume,  the  positive  energy  solutions  are  the  same  while  the  negative  energy 
solutions  have  to  be  written  as 


v(l)A 

v{2)A 

~—E  +  m~ 

0 

1 

0 

Pz 

1 

—E  +  m 

]/E  —  m 

]/E  —  m 

Px  —  iPy 

.Px+iPy. 

.       -Pz      . 

v{\)B 

v{2)B 

Pz 

~Px  —  iPy 

1 

Px  +  i  Py 

—E  —  m 

1 

—  Pz 

]/^  +  m 

]/E  +  m 

0 

0 

—E  —  m 

(71) 


It  is  convenient  to  choose  u(l)A  and  u{2)A  as  the  positive  energy  solutions  and 
v{l)B  and  v{2)B  as  the  negative  energy  solutions  as  in  the  rest  system  u{l)B^ 
u{2)B,  v(l)A  and  v{2)A  vanish. 

Thus  any  solution  for  a  positive  energy  ^  is  a  linear  combination  of  u{l)A 
and  u(2)A  while  that  for  a  negative  energy  —\E\  is  a  combination  o{v{l)B  and 
v{2)B.  To  determine  the  solution  in  any  particular  case  it  is  obvious  that  the 
mere  specification  of  energy  is  not  enough.  There  must  be  some  physical  quantity 
represented  by  an  operator,  the  eigenvectors  of  which  must  be  unique  linear 
combinations    of   these    solutions.^    The   eigenvectors   of  this   operator   must 


1  The  B  solutions  can  be  obtained  by  premultiplying  the  A  solutions  by  iy^  and  replacing 
m  by  -m.  This  is  generally  known  as  the  75,  —wi  invarianoe  of  the  Dirac  equation. 

2  R.  P.  Feynman,  Lectures  delivered  at  the  California  Institute  of  Technology,  on  "Quan- 
tum Electrodynamics"  (1953). 
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correspond  to  different  eigen  values  for  a  given  energy.  This  operator  must 
commute  with  H. 

In  the  rest  system  of  the  electron  the  positive  energy  solutions  u(\)A  and  u{2)A 
degenerate  to 


"1" 

"0" 

0 
0 

and 

1 
0 

_0_ 

_0_ 

"0" 

o"! 

0 

1 

and 

0 
0 

_0_ 

1 

and  the  negative  solutions  v(\)B  and  v(2)B  are 


The  first  two  components  of  the  positive  energy  solutions  are  the  eigenvectors 
of  the  0*2  operator  with  eigenvalues  +  1  and  —  1  respectively.  Similarly  the  last 
two  components  oi  v(\)B  and  v(2)B  are  eigenvectors  with  eigenvalues  ±1. 


Hence  the  spin  operator  a  should  be  where  c's  are  the  2x2  Pauli  spin 

operators. 

The  operator  a  •  p  commutes  with  H  and  any  eigenvector  with  components 
u^,  U2,  ^3  and  u^  such  that 


Px 


Pi 


is  an  eigenvector  of 


a  p 


P  —  Pz 


with  eigenvalue  +  1. 


Thus  neglecting  normalization  factors  the  combination 

{p^-ip^)u{l)A  +  {p-p,)u{2)A^u'{l) 


(73) 


is  an  eigenvector  of  both  H  and 


Op 


corresponding  to  the  eigenvalues  +  E 


and  +  1  respectively.  The  combination 

{p^-ipy)u{l)A  +  {-p-p,)u{2)A=u'{2) 


(74) 


corresponds  to  +^  and  — 1. 


a  'p 


is  the  projection  of  the  spin  in  the  direction 


of  motion.  The  specification  of  the  amplitudes  of  the  spin  in  and  opposite  to 
the  direction  of  momentum  therefore  determines  the  wave  function  correspond- 
ing to  +  E.  If  the  amplitudes  are  a  and  b  in  and  opposite  to  the  direction  of  motion, 
then  the  wave  function  corresponding  to  +  E  is  given  by 

au'{l)  +  bu'{2)  =  cu{l)A  +  du{2)A  (75) 

c  and  d  being  given  by 

c  =  {a  +  b){p^  —  ipy)',          d  =  a{p  —  p,)  —  b{p  +  p^)  (76) 

If  Uq  is  a  unit  vector  it  should  be  noted  that  the  operator  a  •  feo  '^^^  ^^^  commute 
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with  H  ii  \p\=¥  0  and  Icq  is  not  in  the  direction  of  p.  In  the  rest  system  of  the 
particle  the  wave  function  (75)  reduces  to 


(77) 


It  is  possible  to  find  a  spatial  vector  Uq  with  direction  cosines  (/q  ,  ttIq  ,  Uq)  such  that 

the  above  solution  is  an  eigenvector  of  a  -  kq  with  eigenvalue  +  1 .  This  implies 

c        Iq  —  irriQ 

,  i.e.  the  projection  of  spin  in  the  direction  [Iq,  Mq,  Hq)  is  +  1.  Note 


~E  +  m~ 

+  d 

~      0     " 

0 

E  +  m 

0 

0 

0 

0 

d         1  —  ^0 

that  (77)  is  not  an  eigenvector  of  o"  •  fe  if  fe  is  different  from  feg.  k  will  be  different 
from  ko  unless  a  =  I  and  6  =  0.  Summarizing  we  write  the  wave  function  of  the 
particle  with  momentum  p,  energy  E  and  with  the  direction  of  spin  in  the  rest 
system  defined  by  the  direction  cosines  {Iq  ,  ttiq  ,  Uq)  as 


{lQ  —  imQ)u{l)A  +  (1  -  nQ)u(2)A 


(78) 


Particular  cases 


(1)  In  the  case  when  [Iq,  Mq,  tIq)  is  coincident  with  ±z  axis,  the  above  method 
fails,  but  it  is  easy  to  see  that  the  wave  functions  in  these  cases  are  u{l)  A  and 
21  {2)  A  since,  operating  a^  on  the  wave  functions  in  the  rest  system  corresponding 
to  u{l)  A  and  u(2)  A,  we  obtain  the  eigenvalues  ±1,  i.e.  u(l)  A  and  u(2)  A 
are  solutions  corresponding  to  spin  along  ±  z  in  the  rest  system,  noting  of  course 
that  there  is  no  meaning  in  operating  a^  on  u{l)  A  and  u{2)  A  when  |^|  H=  0, 
i.e.  the  system  in  which  the  particle  is  moving. 

(2)  Expressing  u{l)  A  as  a  linear  combination  of  u'{l)  and  u' (2)  we  note  that 
n{\)  A  corresponds  to  spin  with  amplitudes 


1 


Pi-Pz 


P     Pa 


ip, 


in  and  opposite  the  directions  of  motion. 

(3)    If  P.-Py-  0, 


b  = 


p 


1 


P  —  Pz 


2p    p. 


Py 


(79) 


it  is  not  possible  to  obtain  ^'(1)  from  u(\)  A  and  u{2)  A  just  from  the  equations 
(73,  74).  But  it  is  easy  to  recognize  that  u' {I)  is  identical  with  u{l)  A  since  g^ 
operating  on  t^  (1)  ^  yields  +  1.  In  this  case,  the  spin  is  in  the  direction  of  momen- 
tum which  coincides  with  the  direction  of  motion. 

(4)  The  solutions  u(\)  A  and  u{2)  A  with  Pz=  ^  represent  a  particle  moving 
in  the  (x,  y)  plane  with  spin  in  the  ±z  direction  in  the  rest  system  of  the  particle. 

(5)  It  is  interesting  to  compare  u' [\)  and  u'{2).  u' {2)  can  be  obtained  by  just 
reversing  all  the  signs  of  momenta  in  u'{\).  This  implies  that  if  in  one  system 
of  reference  we  have  a  wave  function  u' {\)  and  we  observe  this  wave  function 
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in  a  system  in  which  the  momenta  are  reversed,  the  wave  function  in  the  new 
system  is  u*{2).  This  is  as  should  be  expected  since  spin  is  an  axial  vector  while  p 
is  polar  and  their  relative  orientations  get  reversed  under  inversion. 

In  analogy  with  light  we  can  introduce  the  concept  of  polarization.  A  particle 
is  longitudinally  polarized  if  its  spin  is  in  or  opposite  to  the  direction  of  motion. 
If  in  one  system  of  coordinates  the  momentum  is  in  the  positive  z  direction, 
since  the  velocity  of  the  particle  with  finite  mass  is  less  than  the  velocity  of  light 
it  is  possible  to  find  a  system  of  coordinates  in  which  the  direction  of  momentum 
is  in  the  —z  direction.  But  if  the  spin  is  in  the  +  z  direction  in  the  first  system  it 
is  still  in  the  +  z  direction  in  the  other.  If  the  longitudinal  polarization  is  such 
that  the  spin  and  momentum  are  in  the  same  direction,  we  call  the  particle  right- 
handed;  if  the  direction  of  momentum  and  spin  are  opposite  we  call  it  left-handed. 
The  concept  of  handedness  is  not  relativistically  invariant  for  a  particle  with 
finite  mass  since  for  a  particle  with  spin  in  the  direction  of  motion  in  one  frame  it 
is  possible  to  choose  a  frame  in  which  the  direction  of  spin  is  opposite  to  that  of 
momentum. 

In  collision  problems  where  we  have  a  free  incident  particle  which  is  scattered 
by  a  scattering  centre  and  moves  out  ultimately  as  a  free  outgoing  particle,  it 
is  customary  to  choose  the  incident  particle  to  have  its  Pz  ^  Py  ^  ^  but  Px^  ^ 
and  for  the  outgoing  particle  its  Pz  =  0  but  p^  ^  0,  py  ^  0,  i.e.  we  choose  the  plane 
of  the  directions  of  the  ingoing  and  outgoing  particles  as  the  xy-iplsme. 

We  will  conclude  this  dicussion  about  the  direction  of  spin  with  some  remarks 

O  'P 

on(  transverse  polarization.  We  have  stated  that  it  is  only  for  the  — | — r-  operator 

we  can  obtain  a  wave  function  which  yields  the  value  ±  1 .  If  we  are  interested 
in  the  component  of  spin  in  a  direction  k  not  coincident  with  p  we  can  only 
obtain  average  values  since  a  •  k  does  not  commute  with  the  Hamiltonian  if  k 
is  not  in  the  same  direction  as  p.  In  particular  if  jp  is  in  the  z-direction  we  can  ask 
for  the  average  value  of  the  operator  a^  or  Oy  and  if  this  average  value  is  non- 
zero we  can  say  the  particle  is  "transversely  polarized". 

Till  now  we  discussed  the  meaning  of  the  two  solutions  corresponding  to  the 
positive  value  of  energy.  We  now  turn  to  the  solutions  corresponding  to  negative 
energy.  As  we  had  mentioned  earlier  the  negative  energy  solutions  v{l)  A  and 
v{2)  A  obtained  by  replacing  J5J  by  — E  in  the  positive  energy  solution  vanish 
in  the  rest  system.  But  the  solutions  v(l)  B  and  v(2)  B  have  a  similar 
interpretation  as  u{\)  A  and  u{2)  A.  In  the  rest  system  they  correspond  to  spin 
along  the  i  ^  directions.  All  other  considerations  apply  to  these  negative  energy 
solutions.  Note  that  the  solutions  v{\)  B  and  v{2)  B  can  be  obtained  from  u{\)  A 
and  u{2)  A  by  operating  the  matrix  —  iy^y^, 
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on  them^.  The  physical  interpretation  of  the  negative  energy  solution  is  as 
follows : 

The  positive  energies  form  a  continuum  from  +m  to  oo  and  the  negative 
energies,  if  accepted,  form  another  continuum  from  — m  to  —  oo.  E  cannot  take 
a  value  between  +  m  and  — m.  The  meaning  of  the  negative  energies  is  at  first 
obscure  and  devoid  of  physical  content.  For,  as  we  shall  see  later,  one  of  the 
consequences  of  inclusion  of  negative  energy  states  is  that  if  a  particle  is  initially 
in  a  positive  energy  state  and  if  a  perturbation  is  applied  to  it  for  a  finite  duration 
then  there  is  a  finite  probability  for  the  particle  to  be  in  a  negative  energy  state. 
This  is  contrary  to  physical  fact  since  we  know  that  positive  energy  particles 
always  remain  in  positive  energy  states  even  after  a  perturbation  is  applied. 

To  overcome  this  difficulty,  Dirac  postulated  that  all  the  negative  energy  states 
are  normally  filled.  Thus  a  vacuum  is  a  state  when  all  the  negative  energy  states 
are  filled  with  electrons.  This  postulate  will  be  valid  and  useful  only  if  Pauli's 
exclusion  principle  is  invoked.  Once  Pauli's  exclusion  principle  is  assumed,  it  is 
clear  that  an  electron  in  a  positive  energy  state  cannot  make  a  transition  to  a 
negative  energy  state  if  these  states  are  full.  However  ifa  "hole"  is  produced  by  the 
transition  of  one  of  the  electrons  with  energy  — E,  momentum  p  and  charge  —  e 
(due  to  interaction  with  a  field)  to  a  positive  energy  state,  the  "hole"  manifests 
itself  as  a  positron  of  charge  +  e  momentum  — p  and  energy  +  E.  In  other  words, 
a  free  positron  of  momentum  — p  and  energy  +  E  has  the  same  wave  function 
as  a  free  electron  of  momentum  p  and  energy  — E,  i.e. 

u^{-p,E)  =  u_{p,-E).  (80) 

There  is  another  way  of  dealing  with  negative  energy  states  which  is  equivalen 
to  that  of  Dirac 's  hole  theory.  It  consists  in  extending  the  kernel  function  ap- 
proach to  the  Dirac  equation.  This  was  first  formulated  by  Feynman  and  since 
his  formulation  is  now  accepted  as  the  most  convenient  way  to  deal  with  electro- 
dynamic  processes  (and  meson  theory  to  some  extent),  we  shall  develop  the  per- 
turbation theory  for  Dirac 's  equation  in  Feynman's  approach. 

Before  dealing  with  perturbation  theory  we  shall  discuss  the  wave  equations 
of  other  elementary  particles  keeping  in  mind  that  the  only  valid  wave  equation 
known  is  the  Dirac  equation  for  the  electron.  However  it  is  found  useful  under 
restricted  conditions  to  represent  other  elementary  particles  through  wave  equations. 

Wave  equations  for  the  proton  and  the  neutron 

Protons  and  neutrons,  like  the  electrons  obey  the  Pauli  principle,  have  spin 
\  and  are  assumed  to  be  described  by  the  Dirac  equation.  On  this  assumption 
it  would  merely  imply  that  the  mass  of  the  electron  has  to  be  replaced  by  the  mass 
of  the  proton  or  the  neutron  in  the  equation.  One  of  the  main  objections  to  this 


^  In  the  non-relativistic  case  when  p  =  0,  E  =  m  it  can  be  verified  that  the  efifect  of  multi- 
plying the  Dirac  u  solutions  by  —  i>'4y5  i^  *^®  same  as  that  of  operating  iy^ .  Hence  it  is  usually 
stated  that  y-  connects  the  positive  and  negative  energy  solutions. 
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simple  extension  of  the  Dirac  theory  to  nucleons  is  that  it  yields  for  the  magnetic 
moment  of  the  proton  and^the  neutron  the  values  1  and  0  nuclear  magnetons 
while  the  observed  experimental  values  are  2.78  and  — 1.94  nuclear  magnetons ^ 
respectively.  One  of  the  important  consequences  of  the  Dirac  theory  is  the  pre- 
diction of  anti-protons  and  anti-neutrons.  As  in  the  hole  theory  of  the  electrons 
the  negative  energy  states  are  assumed  to  be  filled  and  as  a  result,  antiprotons 
can  be  produced  or  annihilated  only  along  with  protons.  Similar  considerations 
apply  to  anti-neutrons.  These  are  the  well  known  phenomena  of  pair  production 
and  pair  annihilation  which  have  been  first  observed  in  the  case  of  electrons. 
The  essential  difference  between  the  nucleons  and  the  electrons  lies  in  that 
there  exists  between  nucleons  strong  forces  of  very  short  range  of  non-electro- 
magnetic character.  It  is  postulated  that  nuclear  forces  are  independent  of  the 
charge  of  the  nucleon  and  are  the  same  between  any  two  nucleons  provided  their 
spin  and  orbital  momentum  states  are  the  same.  This  is  described  by  the  intro- 
duction of  the  concept  of  isotopic  spin  and  the  description  of  the  neutron  and 
proton  in  the  isotopic  spin  formalism.^  The  only  role  which  charge  plays  is  in 
determining  the  possible  states  as  are  permissible  by  the  Pauli  principle. 

Assuming  charge  independence  we  can  speak  of  only  one  particle,  the  nucleon, 
which  is  supposed  to  exist  in  one  of  two  states,  the  proton  and  the  neutron,  charac- 
terized by  a  charge  of  1  and  0.^  This  immediately  suggests  the  correspondence 
with  the  spin  formalism  where  a  particle  can  exist  in  one  of  the  two  spin  states 
J  and  — J.  In  analogy  with  this,  we  can  characterize  the  proton  state  by  a 
quantum  number 

Iz  =  i  (81) 

and  the  neutron  state  by 

Iz  =  -i-  (82) 

Though  we  do  not  attribute  to  this  quantum  number  any  dynamical  significance 
as  in  the  case  of  spin*  it  is  clear  that  it  must  be  linearly  related  to  the  charge  of 
the  particle.  It  is  introduced  merely  to  help  us  to  compute  the  number  of  per- 
missible states  of  a  system  of  nucleons.  The  charge,  in  units  of  e,  of  the  nucleon 
is  related  to  I^  by 

._  q  =  iz+h  (83) 

^  In  analogy  with  the  Bohr  magneton  the  nuclear  magneton  is  defined  as . 

2  B.  Cassen  and  E.  U.  Condon,  Phys.  Rev.  60,  846  (1936). 

^  H.  A.  Bethe  and  P.  de  Hoffmann,  Mesons  and  Fields,  Volume  II,  Row  Peterson, 
Evanston,  Illinois  (1955). 

*  The  analogy  between  spin  and  isotopic  spin  is  purely  formal.  See  Foldy,  Phys.  Rev.  93, 
1395  (1954).  In  the  case  of  spin,  though  it  is  possible  to  obtain  a  simultaneous  eigenf unction 
of  only  the  total  spui  and  its  2- component,  the  other  two  components  represent  the  same 
physical  quantity,  the  angular  momentum  in  the  corresponding  directions.  However  in  the  case 
of  isotopic  spin  it  is  only  the  z  component  that  is  related  to  charge  and  the  other  two  com- 
ponents have  no  physical  meaning.  As  regards  conservation  it  is  the  total,  i.e.  the  spin  together 
with  the  orbital  angular  momentum  that  is  conserved  in  a  quantum  mechanical  system. 
But  charge  Q  and  hence  I^  is  conserved  independent  of  /. 
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Thus  the  wave  function  of  a  nucleon  can  be  supposed  to  have  two  components  ^,^ 
and  Yjp  and  can  be  written  as  the  linear  combination  of  the  wave  functions  ?y^ 
and  rl,^  given  by 


Vp 


Vn- 


(83) 


In  our  discussion  on  spin,  we  have  emphasized  that  ±  J  are  the  possible  values 
of  the  2;-component  of  the  spin.  The  spin  matrix  a^  operating  upon  the 
wave  function  yields  the  eigenvalue  of  the  z-component  of  spin,  a^  and  Oy  are 
matrices  such  that  the  eigenvalue  of  the  operator  o% -\-  o^  +  gI  yields  the  total 
spin  momentum  of  the  particle. 

Having  introduced  the  two  component  formahsm  to  represent  the  state  of 
the  nucleon,  we  can  proceed  on  similar  lines  as  outlined  above.  We  introduce  the 
isotopic  spin  operator  Xz  which  yields  the  eigenvalue  /^  =  ±  J  and  the  operator 
can  be  represented  by  the  matrix  a^-  This  implies  that  we  can  also  conceive  of  the 
operators  Xr^,  Xy  represented  by  the  matrices  a^,  Oy.  By  analogy  with  spin 
the  eigenvalue  of  t|  +  t^  +  rf  is  /(/  +  1)  where  /  =  i  for  a  system  containing 
a  single  nucleon.  /  is  the  total  isotopic  spin  quantum  number  and  I^  the  quantum 
number  corresponding  to  the  z-component  of  the  isotopic  spin  which  is  related 
to  the  charge.  From  x^  and  Xy  we  can  form  x+  =  Kt^;  +  iXy)  and  t_  ==  ^{x^  —  ity) 
which  have  interesting  physical  interpretations 

T^+Vn  =  Vp  T^-Vp  =  Vn>  (^4) 

i.e.  x+  transforms  a  neutron  into  a  proton,  t_  transforms  a  proton  into  a  neutron. 
If  we  have  a  system  of  nucleons^,  the  2-component  of  the  total  isotopic  spin 
is  the  sum  of  the  z-components  of  each  particle : 

t^=iI;r^^.  (85) 

i 

Similarly  ty  and  t^  are  defined.  If  we  now  define  the  total  isotopic  spin  operator  t  by 

t'  =  tl  +  tl  +  tl  (86) 

in  complete  analogy  with  spin,  the  eigenvalue  of  ^^  is  given  by  7" (T+  1).  7^  being 
the  total  isotopic  spin  quantum  number  of  the  system  of  nucleons.  T  is 
obtained  by  the  usual  vector  addition  of  the  isotopic  spins  of  individual  nucleons 
each  of  which  has  magnitude  J  with  the  condition  that  the  composition  should 
be  such  that  it  is  non-negative.  Hence  T  has  integral  eigenvalues  if  the  number 
of  nucleons  A  is  even  while  T  is  half-integral  if  A  is  odd.  The  maximum  possible 
value  of  T  is  obviously  A 12. 

Since  Mrp  is  the  eigenvalue  of  the  z  component  of  t  and  T  is  the  quantum 
number  of  total  isotopic  spin,  we  have  for  any  given  T 

Mt=  T,T-l,  .  .  .,—T.  (87) 

^  We  use  the  symbol  t  for  the  isotopic  spin  operator  of  a  system  of  particles  and  t  for 
a  single  particle,  their  eigenvalues  being  denoted  by  T  and  /  respectively. 
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The  charge  of  the  system  is  related  to  Mrp  by  the  relation  Z  =  M^  +  \A.  Thus 
the  charge  of  a  nucleus  defines  only  if  y  and  not  T.  Conversely  if  ^  and  Z  (hence  M^) 
are  given,  the  T  values  of  the  nucleus  are 


\Mr, 


\Z-\A\,  \Mt\+\,...,\A 


(88) 


To  fix  our  ideas,  let  us  consider  the  case  of  a  two-nucleon  system :  the  f  —  f, 
n  —  n  and  ^  —  n  systems.  We  list  below  the  M>]^  and  T  values  of  these  systems  and 
also  their  charge  wave  functions  with  their  symmetries. 


System 

M  q, 

T 

Charge  wave  function 

Symmetry 

of  charge 

wave  function 

v-v 

1 

1 

xxA^n'i'nf 

S 

n-n 

-1 

1 

Xi,-i- V^^'rif 

S 

p-n 

0 

1 

Ao-  y2    [<^«^+#^^''] 

s 

p-n 

0 

0 

Zo.o-  ^2    IV?€^-<V'^'] 

A 

Of  these,  the  first  three  are  charge  symmetric  with  respect  to  interchange  of 
particles  1  and  2  whereas  the  last  changes  sign.  Now  the  Pauli  principle  states 
the  wave  function  must  be  anti-symmetric  under  the  exchange  of  all  coordinates,  i.e. 
space,  spin  and  isotopic  spin.  Thus  if  the  charge  wave  function  is  one  of  the  first 
three  listed,  the  space-spin  wave  function  is  anti- symmetric  while  it  is  symmetric 
if  the  charge  wave  function  is  of  the  fourth  type. 

The  postulate  of  charge  independence  of  nuclear  forces  states  that  the  forces 
do  not  depend  upon  the  charge  of  the  nucleons  provided  the  space-spin  state  of 
the  system  is  the  same.  Therefore,  the  energy  levels,  scattering  phase  shifts  etc. 
are  the  same  for  states  involving  the  same  space-spin  state  multiplied  by  any  of 
the  first  three  charge  wave  functions  ;fi  ^ ,  ;fi,_i,  Xi,o-  The  last  one  ;fo,o  i^  anti- 
symmetric in  the  charge  wave  function  and  has  to  be  multiplied  by  a  symmetric 
space -spin  wave  function  and  hence  it  will  behave  quite  differently.  Thus  we  see 
that  the  p-n  system  with  symmetric  charge  wave  function  behaves  like  p-p 
and  n-n  systems  and  quite  differently  from  p-n  system  with  anti-symmetric 
charge  wave  function.  Hence  the  behaviour  of  the  system  is  determined  by  the 
symmetry  of  the  charge  wave  function.  We  also  note  that  the  symmetric  charge 
wave  functions  ;fi,i,  Xho^  Xi, -i  bave  the  same  isotopic  spin  1  and  form  a  triplet 
in  charge  space  with  iify  =  1,0, — 1  respectively.  The  anti- symmetric  charge 
wave  function  ;^o.o  bas  the  total  isotopic  spin  0.  Thus  we  arrive  at  the  im- 
portant result  that  the  behaviour  of  the  system  is  determined  only  by  T  and 
not  by  Mrp. 
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Wave  equation  for  the  meson 

The  Klein-Gordon  equation  was  obtained  merely  by  imposing  the  relativistic 
requirements  on  a  particle  described  by  a  one  component  wave  function,  i.e. 
a  particle  of  zero  spin.  The  neutral  meson  is  one  such  particle.  Historically  the 
meson  was  predicted  by  Yukawa^  in  attempting  to  understand  the  nature  of 
the  nuclear  forces.  Since  by  a  nucleon  is  meant  both  a  neutron  and  a  proton, 
Yukawa's  theory  included  positive,  negative  and  neutral  mesons.  Thus  it  became 
necessary  to  introduce  the  isotopic  spin  formalism  into  the  Klein-Gordon  equation. 

Since  field  theory  (to  be  discussed  later)  is  almost  as  old  as  wave  mechanics, 
Yukawa's  original  paper  was  written  in  field  theoretic  formalism.  In  view  of  the 
"epoch  making  idea  that  the  introduction  of  the  ever  unknown  new  field  enables 
us  to  account  for  the  nature  of  the  nuclear  forces"  we  shall  summarize  the  results 
of  the  celebrated  paper  in  wave  mechanical  formalism  by  a  suitable  adaptation 
to  fit  in  with  our  discussion  of  the  wave  functions  of  single  particles. 

Yukawa  introduced  a  field  of  force  to  describe  the  interaction  between  the 
nucleons  in  close  analogy  with  the  electromagnetic  field  which  describes  the 
interaction  between  charged  particles.  The  scalar  potential  of  the  electromagnetic 
field  satisfies  the  equation 

Substituting  U  =  V{r)  e^'"'  we  have 

{P2+ a;2}  7=  0.  (90) 

This  has  a  solution  V (r)  =-  —  e*"*^  where  r  =  (x^  +  y^  +  z'^fl^  Leaving  aside  the 

oscillatory  part,  ^  is  the  spherically  symmetric  static  solution  for  the  potential, 

valid  for  all  r  except  r  =  0.  The  potential  between  the  nucleons  can  be  defined 
in  analogy  with  this  static  potential;  however  it  should  not  be  of  the  Coulomb 
type  but  should  decrease  more  rapidly  with  distance.  Yukawa  required  a  potential 
of  the  type 

±^2___  (90a) 

where  g  has  the  dimensions  of  charge  and  jll  that  of  inverse  length.  This  function 
(90a)  is  a  static  solution  in  the  same  sense  as  above  of  the  equation 


[F'—^-^'^U{x,y,z,t)  =  0 


(91) 


with  ju  given  by  /u^  =  ?.^  —  co^  provided  Jl>|co|.  However  when  A<|ft)|,  /u 
will  be  purely  imaginary.  The  corresponding  solution  is  a  spherically  symmetric 
wave  as  in  the  case  of  the  electromagnetic  field.  In  such  a  case  (91)  can  be  treated 


1  H.  Yukawa,  Proc.  Phys.  Math.  Soc,  Japan  17,  48  (1935). 
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as  a  free  particle  equation  as  in  the  case  of  the  photon  field,  the  source  being  taken 

to  be  at  an  infinite  distance.  The  static  solution  of  the  inhomogeneous  equation 

(P2_^2)  C7(a;,  y,z)=^-  ingfix,  y,  z)  (92) 

is  obtained  according  to  the  standard  theory  of  partial  differential  equations  as 


/ 


gfix'y'z')"    _^^.  d»/  (93) 


where  gf(x,y,z)  is  interpreted  as  the  "source  distribution  function"  of  the 
potential.  If  we  have  a  point  source  at  r'  =  0,  we  have  merely  to  write 

.     fix',y^,z')  =  d{x')d{y')S{z')  (94) 

and  the  solution  reduces  to  g — .  If  U  were  to  represent  the  meson  field  the 

source  function  should  be  the  nucleon  density.  If  we  wish  to  envisage  positive, 
negative  and  neutral  mesons,  the  isotopic  spin  formalism  should  be  suitably 
imbedded  in  the  nucleon  density.  Ifip  is  the  two-component  nucleon  wave  function 
(i.e.  in  isotopic  spin  formalism)  we  can  convince  ourselves  that  the  source  function 
for  the  positive  meson  potential  should  be 

4:7igipr^W  (95) 

which  merely  expresses  that  the  proton  wave  function  is  taken  over  to  the  neutron 
state.  The  corresponding  source  functions  for  the  negative  and  neutral  mesons 
are 

4:ngy}r+yj,4:7igfr^yj  (96) 

Thus  we  have  as  on  the  right  hand  side 


F2____p|  Ui  =  -47igy^Tiy^  (97) 

i  =  — ,  + ,  3  corresponding  to  positive,  negative  and  neutral  mesons.  Since  t_  =  t+ 
the  wave  function  U_  foy  the  negative  meson  is  U+  .  This  is  in  conformity  with 
wave  mechanics  where  the  charge  conjugate  solution  is  obtained  simply  by 
complex  conjugation.  For  the  neutral  meson  Tg  =  Tg  and  obviously  C/g  =^  U^. 

We  disregard  for  the  moment  that  the  nucleon  has  spin.  Since  no  correct 
relativistic  theory  exists  for  heavy  particles  we  assume  that  the  simple  non- 
relativistic^  equation  for  ip  neglecting  spin  can  be  written  as 


II  4  (    mJ 


M^  r  ^'  dt      2    ^      2    '-"^ 


+  g{U_r.+  U^r_) 


^  =  0  (98) 


1  It  is  to  be  noted  that  by  non-relativistic,  we  only  mean  that  the  velocity  of  the  nucleon 
is  negligible  but  we  take  the  relativistic  expression  for  the  energy. 
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corresponding  to  the  Hamiltonian 
-T^s    ,    1  +  T; 


H  = 


4M„  4:M 


f  +  ^^-  -Mn  +  ^-^'  M^^g{ U_  T_  +  U,T,).      (99) 


Each  term  of  the  Hamiltonian  is  self-explanatory.  Consider  now  a  transition 
from  a  proton  state  of  energy  w^  to  a  neutron  state  of  energy  co^  •  These  states 
can  be  expressed  as 

Substituting  in  (97)  we  have 

F2  — A__p|  U^  =  —4.nguve-''^''v-"'n\  (101) 

Put   U+  =  U'^{x,y,z)e*^"'^  where  m  =  {a).,^  —  cOp).  We  have 

{i^^  —  {X^  —  co^)}U',  =  —4.7iguv.  (102) 

Integrating,  o-H-'^l 

where  as  before  we  have  set  /j,  =  )/A^  —  co^. 

(i)  When  A > co,  ^  is  real  and  depends  on | co„  —  cOp\.  This  shows  that  the  range 
of  the  interaction  between  a  neutron  and  a  proton  increases  with  |  co^^  —  cOp\, 

(ii)  When  A  <  |  ft>  |,  //is  imaginary  and  we  interpret  this  wave  as  that  corre- 
sponding to  the  quanta  of  the  field.  This  implies  that  a  quantum  of  energy  greater 
than  ?i  can  be  emitted  in  a  proton  to  neutron  transition  provided  |a>^  — cOp]  >  A. 

One  simple  question  can  be  raised  in  regard  to  our  analogy  between  the  electro- 
magnetic and  the  meson  fields.  In  the  meson  field  we  have  made  the  statement 
that  if  ^2  <^  (^2  ^  becomes  imaginary  and  therefore  we  do  not  get  the  potential 
solutions.  But  A  =  0  should  correspond  to  the  photon  field  and  in  this  case  though  jlc 

is  imaginary  we  interpreted  —  as  the  static  solution.  This  interpretation  should  not 

be  carried  to  the  case  of  the  meson  field  when  ju  is  imaginary  as  the  physical 
situation  is  different  from  the  photon  field.  When  enough  energy  is  given  to  the 
nucleon  system,  i.e.  if  co^  >  A^,  the  meson  goes  well  out  of  the  very  short  range 
of  the  potential  field  and  can  therefore  be  considered  free.  In  the  electromagnetic 
case  the  situation  is  much  more  complex  and  the  static  and  radiation  fields  have 
to  be  considered  together  in  any  description  since  the  static  Coulomb  field  has 
infinite  range. 

Wave  equation  for  the  neutrino 

It  will  be  interesting  to  study  the  Dirac  equation  in  the  limit  as  the  mass  m 
tends  to  zero.  For  this  purpose  it  is  convenient  to  go  back  to  the  derivation  of 
the  Dirac  equation  through  the  relativistic  generalization  of  the  Pauli  equation.  We 
have       ,  ,  ^ 

^  where     71^  =  1— -  —  eA^;         ct  =  p  —  eA        (109) 

3* 


36  ELEMENTARY  PARTICLES  AND  COSMIC  RAYS 


function  was  identified  to  be 


and  (p  and  ^  are  two  component  wave  functions.  The  four  component  Dirac  wave 

(p  -\-  ')(^ 

.  If  we  let  m->0  we  notice  that  w  and  y 

are  uncoupled  and  should  represent  the  wave  equation  of  particles  with  spin  J 
and  mass  zero.  Such  a  particle  had  indeed  been  postulated  by  Pauli  in  order  to 
explain  the  continuous  spectrum  of  p  particles  emitted  by  radioactive  nuclei. 
It  was  a  theoretical  necessity  that  such  a  particle,  called  the  neutrino  should 
have  mass  zero,  spin  half  and  neutral  charge,  for  energy,  angular  momentum  and 
charge  conservations  respectively. 

The  limit  m  ^-  0  in  the  above  equation  gives  rise  to  an  interesting  feature  not 
present  in  the  Dirac  case.  If  mass  is  finite  as  in  the  Dirac  case,  an  eigenfunction 
oi  <j  '  p  need  not  be  an  eigenfunction  of  the  Hamiltonian  H,  i.e.  it  can  be  a  linear 
combination  of  eigenfunctions  of  H  corresponding  to  positive  and  negative 
energies.  Thus  we  can  have  electrons  in  positive  and  negative  energy  states 
having  their  spins  both  parallel  or  antiparallel  to  their  momenta.  If  we  set  m 
equal  to  zero  then^^x^T  •  p  and  the  eigenfunctions  oi  O  ■  p  corresponding  to  the 
two  eigenvalues  i  1  (spin  parallel  or  antiparallel  to  the  momentum,  i.e.  positive  or 
negative  helicity)  are  also  the  eigenfunctions  of  the  Hamiltonian  with  positive 
and  negative  eigenvalues  of  energy.  But  still  the  question  remains  as  to  what 
helicity  should  be  attributed  to  the  positive  and  negative  energies  respectively. 
In  analogy  with  the  Dirac  hole  theory  if  the  antineutrino  is  to  be  identified  as 
a,  hole  in  the  negative  energy  sea  of  neutrinos,  the  antineutrino  has  the  same 
helicity  as  the  negative  energy  neutrino  or  opposite  to  that  of  the  neutrino. 
Experiments  have  decided  in  favour  of  negative  helicity  for  the  neutrino  and 
positive  helicity  for  the  anti-neutrino. 

A  positive  energy  neutrino  with  spin  anti-parallel  to  the  momentum  under 
spatial  reflection  would  have  spin  parallel  to  momentum  which  is  a  forbidden  state 
for  a  neutrino  with  positive  energy.  This  is  precisely  the  reason  why  the  two 
component  theory  of  the  neutrino,  first  discussed  by  WeyP  was  rejected. 
Since  it  has  been  found  that  neutrinos  are  produced  in  weak  interactions  which 
do  not  conserve  parity,  the  possible  use  of  the  two  component  theory  was  pro- 
posed and  considered  by  Lee  and  Yang,^  Landau^  and  Salam.*  It  is  also  seen 
that  the  theory  is  not  invariant  under  charge  conjugation. 

The  two  component  formalism  can  also  be  derived  .directly  from  the  four 
component  Dirac  equation  as  follows : 

Consider  the  Dirac  equation 


i« 


-iy^—--^m]tp  =  0.  (110) 


1  H.Weyl,  Z.  f.  Phys.  56,  330  (1929). 

2  T.  D.  Lee  and  C.  N.  Yang,  Elementary  Particles  and  Weak  Interactions,  Office  of  Tech- 
nical Services,  Washington,  D.  C,  25. 

3  L.  Landau,  Nucl.  Phys.  3,  127  (1957). 

4  A.  Salam,  Nuovo  Cim.  5,  229  (1957). 
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where  C  is  defined  through  C  ^y^fi 
The  4-component  \p 


The  charge  conjugate  of  this  equation  is 

(-*>.^  +  ^)^V^^  =  0,  (111) 

rj,  c^  =  -  c,  ct  =  c-\ 

is  a  solution  of  the  Dirac  equation  of  first  order 

.<p-x\ 

but  y^ip  is  not  a  solution  unless  m  =  0.  But  if  we  take  the  second  order  form  of 
the  Dirac  equation  , 

then  y^ipi)  is  also  a  solution  since  y^  commutes  with  the  operator  on  the  left-hand 
side.  Defining 

we  note  that  ^+  and  ip_  are  not  solutions  of  the  Dirac  equation  since  (1  +  y^) 
does  not  commute  with  the  Dirac  Hamiltonian.  xp+  and  ip_  are  charge  conjugate 
solutions  and  they  satisfy 

a 


-lyt  - — \p^-^m^_ 


0 


(114) 


iy'. 


dx. 


y)_-^mtp^  =  0. 


\p+  and  xp_  can  be  identified  in  terms  of  (p  and  x  ^s 


W, 


and 


V- 


X 
—X 


(115) 


\p+  and  \p_  have  eigenvalues  ±  1  with  respect  to  y^  and  are  known  as  the 
positive  and  negative  chiral  states  respectively,  the  eigenvalue  being  termed 
the  chirality  of  the  state.  Thus  99  and  x  uiight  as  well  be  chosen  as  the  wave 
functions  of  the  neutrino  and  the  anti-neutrino  respectively. 

The  covariance  of  these  equations  under  proper  Lorentz  transformations  is 
easily  demonstrated  by  defining  the  transformation  matrix  S  which  connects 
the  wave  function  99  to  its  transform  cp' ,  i.e. 

(p'{x')=^  S(p(x).  (116) 

For  a  rotation  of  the  axis  through  d  around  the  spatial  direction  q, 
S=  exp{^id6 '  q).  For  a  Lorentz  transformation  corresponding  to  a  velocity  V 
in  the  direction  q,  S  ^  exp{^Va  •  q).  All  other  proper  transformations  can  be 
generated  from  these  two  fundamental  transformations. 

When  we  consider  discrete  transformations  such  as  space  reflection,  we  find 
that  the  equation  (114)  is  no  longer  co variant  in  the  sense  that  we  are  unable  to 
find  an  S  such  that  (p'{x')  =  <^(p{x).  For  in  order  to  have  covariance  we  need  for 
the  operator  a 


So8-^  = 


(117) 
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However  a  and  — a  satisfy  different  commutation  relations.  Hence  no  unitary 
transformation  is  possible.  However  if  we  modify  the  definition  of  co variance 
under  spatial  reflection  to  imply 

(p'{x')^-- S(p*(x)  (118) 

then  it  is  possible^  to  find  an  S  which  satisfies  such  a  relation.  Since  a*  and  ai 
have  the  same  commutation  relations  it  is  possible  to  find  a  matrix  A  such  that 

AofA-^  =  ~Oi.  (119) 

From  the  properties  of  o  matrices  it  follows  that 

A  =  A-^==A*  (120) 

JS  can  be  now  identified  as 

S=eA  (121) 

where  s  is  imaginary.  The  purpose  of  using  the  auxiliary  matrix  A  is  to  draw 
attention  to  the  fact  that  in  the  case  of  time  reflection  8  can  be  written  as 

S^  f^A'  ^  (122) 

where  jn  is  real. 

It  is  interesting  to  note  that  this  modified  definition  is  the  one  used  to  prove 

the  covariance  of  Maxwell's  equations  under  spatial  reflection.  Thus  we  can 

conclude  from  this  point  of  view  that  equation  (114)  is  co  variant  under  the  full 

Lorentz  group.  In  other  words,  the  presence  or  absence  of  the  mass  term  does 

not  affect  the  covariance  of  the  free  Dirac  equation  and  hence  also  the  interactions 

formed  from  such  wave  functions. 

3.  WAVE  MECHANICS   OF  THE  PHOTON 

Wave  equation  for  the  photon  ^ 

Our  discussion  of  single  particle  equations  will  be  complete  only  if  we  include 
the  photon  as  an  elementary  particle  of  mass  zero  and  discuss  its  equation  of 
motion  from  the  point  of  view  of  wave  functions.  While  in  the  case  of  particles 
like  the  electron  the  wave  function  is  postulated,  in  the  case  of  the  photon  the 
electromagnetic  field  associated  with  it  is  already  represented  in  configuration 
space  through  the  Maxwell  equations.  Thus  if  the  photon  were  to  be  described 
through  a  Schrodinger  equation  the  latter  should  be  related  to  the  Maxwell 
equations.  It  turns  out  that  despite  the  fact  that  the  electromagnetic  field 
variables  E  and  H,  or  equivalently  A  and  A^  are  functions  of  space  and  time 
there  is  no  meaning  in  speaking  of  a  wave  function  of  a  photon  in  configuration 
space^.  However  we  can  by  taking  a  suitable  combination  of  the  momentum 


1  K.  M.  Case,  Phys.  Rev.  107,  307  (1957). 

2  This  section  is  based   on  Quantum  Electrodynamics,   Part  I,  by  A.  I.  Akhiezer  and 
V.  B.  Berestetsky  (English  translation)  Office  of  Technical  Services,  Washington,  D.C.,  25. 

^  That  is  if  we  require  a  probability  interpretation  as  defined  in  chapter  I. 
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transforms  of  E  and  H  obtain  a  wave  function  for  the  photon  in  momentum 
representation  which  satisfies  a  Schrodinger  equation.  Thus  we  start  by  writing 
the  Maxwell  equations  in  momentum  space  and  derive  the  Schrodinger  equation 
of  the  photon  therefrom.  In  this  procedure  we  realise  that  the  very  familiar 
Maxwell  field  has  a  complicated  structure  and  the  wave  function  formalism  of 
the  photon  has  peculiar  features  not  met  with  in  the  case  of  other  elementary 
particles.  We  realise  that  the  quantum  mechanics  of  the  photon  should  necessarily 
be  relativistic  ab  initio.  We  shall  first  study  the  photon,  i.e.  the  electromagnetic 
field  in  the  absence  of  charges.  So  we  take  the  Lorentz -invariant  free-field  Maxwell 
equations  and  obtain  the  wave  function  for  the  photon. 

Schrodinger  equation  of  the  photon  in  momentum  space 

The  Maxwell  field  is  completely  specified  by  the  distribution  of  the  electric 
and  magnetic  vectors  in  space  and  time.  These  vectors  E{x,  t)  and  H(x,  t)  satisfy 
the  coupled  equations 

cur\E= divif=0 

dt 

3E  ^^^ 

curlH=-— -  divE  =  0. 

ct 

Since  the  charge  density  q  is  assumed  to  be  zero  everywhere  (i.e.  div  J5=  0)  we 
tare  dealing  with  a  radiation  field  and  thus  only  with  photons. 

Taking  the  Fourier  transforms^  of  E(x)  andH(ap)  sis  E{k)  and  jar (fe)  defined  by 

E{k)=:-^jE{x)e-^''-d^x;  H{k)=^-^JH{x)e-^^-A^x,  (2) 

we  can  write  the  Maxwell's  equations  in  momentum  space  as 

H{k)  =  -i[k  X  E{k)],     E{k)  =  i[k  x  H{k)] 
k'E{k)  =  0,  k'H{k)^0.  (3) 

ilk  X  Elk)^ 
Further  we  see  that  H{k)  ^ — and  the  reality  condition  on  E  and  H 

imposes  the  relationships 

E(—k)  =  E*{k)    and    H{—k)  =  H*{k)^  (4) 

To  arrive  at  the  wave  equation  we  now  define  a  function  f{k)  by: 
E{k)  =  N{k)[f{k)+f*{-k)] 
E{k)  =  -ikN{k)  [f{k)-f*{-k)] 


^  In  defining  these  transforms  we  are  adopting  natural  units.  Otherwise  we  should  use 
kjh  instead  of  k  and  here  the  Fourier  transform  is  defined  with  a  factor  (27r)^  unlike  those 
in  chapter  I,  sec.  I.  k  being  used  for  \k\  where  convenient. 
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where  N{k)  the  normalizing  factor  turns  out  to  be  equal  to  ]/kl4:7t^l^  and  the 
function  /  satisfies  the  transversality  condition: 

,k'fik)  =  0  (6) 

Substituting  for  E{k)  and  E{k)  in  terms  of /(fe)  in  the  Maxwell's  equation 

-^  +  k''^E{k)  =  0,  (7) 

we  obtain  the  following  equations  for  /(fe)  and  its  conjugate 

i^=>cfm,     -.^=.rw  (8) 

Since  k  is  the  energy  of  the  photon  in  our  chosen  unit,  the  above  has  the  form 
of  ^^e  Schrodinger  equation  for  the  photon  in  momentum  space  if  the  energy  operators 
of  the  photon  in  fe  space  is  defined  as  the  operator  of  multipUcationby  the  wave 
number  k.  Thus  we  identify  f[k)  to  be  the  wave  function  of  the  photon  in  mo- 
mentum space.  To  verify  this  we  show  that  the  energy  defined  through  the  electric 
and  magnetic  fields  is  identical  with  that  derived  from  the  Schrodinger  equation. 

Energy  of  the  electromagnetic  field 

The  energy  of  the  electromagnetic  field  is  given  by  the  usual  expression  quadratic 
in  E  and  //as 

cd  =  ^j{E^  +  m)^^x.  (9) 

Using  the  "wave  function"  of  the  photon  we  can  calculate  the  average  value 
of  the  energy  as 

cd=ff*(k)cof{k)d^k.  (10) 

To  show  the  equivalence  of  equations  (9)  and  (10)  we  write  field  vectors  in  terms 
of  their  momentum  transforms : 

CO  =  \j[E{k)  E{k')  +  H(k)  H(k')]  e^^'^  +  '^'^'^d^kd^k'  d^x.  .(11) 

This  when  expressed  in  terms  of  /(fe)  and  /*  (fe)  reduces  to 

M  -  l6  7i^fN^f*(k)f(k)  d^k  (12) 

and  since  N  has  been  chosen  as  yk/^n^l^  we  see  that 

CO  =//*(fe)  kf(k)d^k  =ff*{k)cof(k)  d^k.  (13) 

If  we  take  a  monochromatic  solution  of  equation  (8) 

/(fe)=/o(fe)e-^^  (14) 
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corresponding  to  one  of  the  energy  eigenvalues  k  =  p  for  the  co  operator  we 
find  that  co  =  pjf*{k)f{k)d^k  =  p  ii  the  wave  function  /  satisfies  the  normali- 
zation condition^  ^ 

ff*(k)f{k)d^k  =  l.  (15) 

Momentum  of  the  photon  field 

The, momentum  of  the  electromagnetic  field  in  terms  of  E  and  H  is  given  by 
the  integral 

p==j[E  X  Hjd^x,  (16) 

This  integral  on  substitution  in  terms  of  the  functions  f(k)  is  found  to  be 

(27r)3./.V2.{/*(fe)/(fe)-/*(-fe)/(-fe)+/*(-fe)/*(fe) 

-fik)fi-k)}~d'k  (17) 

p=Jf*(k)kf{k)d^k.  (18) 

Thus  the  momentum  of  the  field  is  given  as  the  quantum  mechanical  average 
of  the  photon  momentum  operator  k. 

Wave  function  in  configuration  space 

Though  it  may  appear  possible  for  us  to  go  over  to  the  wave  function  of  the 
photon  in  configuration  space  by  taking/(ap)  the  Fourier  inverse  of  the  function/(fe), 
we  realise  that  such  a  representation  is  not  possilbe. 

Due  to  the  normalization  condition  on  f{k)  we  know  that  f*{k)f{k)  is  the 
probability  density  of  the  photon  in  fe-space.  But  the  normalization  condition 
in  configuration  space 

ff*{x) fix)  d^x=l  (19) 

cannot  lead  to  the  interpretation  that  /*  (x)  f(x)  is  the  probability  of  finding 
a  photon  at  a?.  This  is  because  the  presence  of  the  photon  is  detected  by  its  inter- 
action with  charges  which  is  determined  by  the  values  of  E{x)  and  H{x)  at  a 
given  point.  From  equation  (5)  we  see  that  due  to  the  presence  of  ^k  in  N,  E(x) 
and  f(x)  are  not  connected  by  a  local  hut  by  an  integral  relationship.  Hence  the 
probability  density  for  a  photon  to  be  at  a  point  in  space  does  not  exist. 

Plane  wave  solutions 

The  photon  wave  function  f{k)  in  momentum  space  satisfies  the  Schrodinger 
equation  (8)  and  transversality  condition  (6).  Considering  a  single  photon 
with  a  definite  momentum  k  =  p,  we  write  the  photon  wave  function  as 

f        fee--'   g[fc  +  P  +  ^P]g[fe-(P-^P)]  f20» 

^  The  normalization  implies  that  we  are  interpreting  the  electromagnetic  field  as  being 
associated  with  one  photon,  f{k)  being  the  amplitude  for  the  photon  having  a  momentum  fe. 
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i;ei,= 

-0;6,, 

=  0 
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where  e^  is  the  polarization  vector  with  modulus  unity  and  is  perpendicular  to 
the  momentum  vector  p  and  H{x)  is  the  Heaviside  unit  function  defined  by 

H{x)  =  lfoTx>0  (21) 

=  Ofor  a;<0. 

The  product  of  the  Heaviside  functions  is  chosen  to  express  the  fact  that  the 
photon  momentum  lies  between  p  and  p  +  Ap.  The  normaUzation  condition  of 
a  single  photon  in  the  momentum  range  Ap  necessitates  the  term  {Apy^- 
in  the  denominator.  For  a  given  p,  two  linear  polarizations  or  two  circular  polari- 
zations are  possible,  denoted  by  the  values  1,  2  for  the  index  /u.  Taking  p  to  be 
along  the  2 -axis  they  can  be  represented  by  either 

linear  polarization.  (22) 

=  0 

■  circular  polarization .  (23) 


Thus  in  either  case  Ci  •  62  =  0  and  the  /^^,  solutions  form  a  complete  orthonormal 
set  for  the  given  set  of  photon  quantum  numbers  Px,  Py,  Pz  ^^^  1^  Any  arbitrary 
function  f{k)  can  be  expanded  in  terms  of  this  set  fp^^ 

f(k)=  yjc^"'fpf.  '(24) 

p/j 

where  c^'' =jff*f,d^k . 

Angular  momentum  and  spin  of  the  photon 

In  a  manner  similar  to  the  above  we  proceed  to  identify  the  expectation  value 
of  the  photon  angular  momentum  in  the  state  /(fe)  with  the  angular  momentum 
of  the  field. 

The  angular  momentum  in  terms  of  E  and  H  of  the  field  is 

M=fx  x{ExH)d^x.  (25) 

In  terms  of  the  transforms  E{k)  and  H{k)  this  becomes 

M=j  XX  [E(k)  X  H{k')]  e'('''^''">>d^kd^k' d^x.  (26) 

Substituting  in  terms  of  f{k)  and  after  considerable  calculation  we  arrive  at 

M=ld^k{-i[kxV{f^'f)]-i[f*xn}  (27) 

Where  the  index  c  indicates  that  in  performing  the  dififerentiation  fc  is  treated 
as  a  constant  or  its  components 

,%=ld^k{f$[-ikxP]J,-ie^SynU)  (28) 
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where  e^^y  is  the  anti-symmetric  unit  tensor  of  rank  three.  Defining  an  operator 
8  such  that 

Kip--i^.Pyty  (29) 

whereby  any  cross  product  \cf  x  /]  can  be  written  as 

[y  ■=<!].  =  g(is,)f  (30) 

we  can  write  the  angular  momentum  of  the  field  as 

M^jjti-ihxf  +  S]f,dk  (31) 

Thus  we  see  that  the  quantum  mechanical  average  of  the  operator  [ — ikxV  IS] 
yields  the  angular  momentum  of  the  photon  field. 

It  is  shown  later  that  the  angular  momentum  operator 

M=^iJ=  -i[kxP]  +  S  (32) 

corresponds  to  an  infinitesimal  rotation  operator  J  of  a  vector  field.  This  operator  3/ 
can  be  shown  to  obey^  the  commutation  relations 

M^  if  „  -  My  M..  =  i  M. 

(33) 
M^M^-M^M,,  =  0  ^     ^ 

and  since  M^  commutes  with  the  energy  operator  and  has  eigenvalues  j  {j  -\-  1 ) 
and  since  M^  can  be  shown  to  have  (2/4-1)  eigenvalues  ranging  from  — j  to  j, 
the  photon  states  with  definite  values  of  oj,  M^,  and  M^  exist  with  corresponding 
quantum  numbers  p,  j  and  M.  We  shall  obtain  the  wave  functions  of  these  states. 

Spin  wave  functions 

From  the  above  discussion  we  see  that  a  rotation  operator  for  a  vector  field 
involves  two  terms,  one  of  the  type 

L=-i[kxF]  (34) 

called  the  angular  momentum  operator  (in  momentum  representation)  and  a 
spin  operator  S  defined  by  (29).  Hence  the  photon  wave  function  should  be 
dependent  on  k  its  momentum  and  another  spin  variable  oc ;  L  operates  only  on 
the  k  space  and  S  only  on  the  spin  space. 

It  can  be  seen  that  the  operator  L^  has  eigenfunctions  corresponding  to  eigen- 
values l{l  +  1)  and  the  operator  L^  has  (2Z  +  1)  eigenvalues,  i.e. 

L'nn.-W+^)^l,m  (35) 

L.(pi,m=^'^ffl,m  (36) 

where  (pi„^  are  the  usual  spherical  functions 

nm  =  ci{k)Y\«l:     n  =  ^  (37) 
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a(k)  and  y^^  being  normalised  suitably.  That  is 

fa* {k)a{k)k^dk=l     and     /y*^«)  yj")  d  0  =  6i,  S,„„. 


(38) 


The  operator  8  consists  of  3  x  3  matrices  with  corresponding  eigenfunctions 
Xsfii^)  which  satisfy  the  equations 

^^''Xs.u-s{s+l)Xs,c  '  (39) 

^zZsfi  =  f^XsM  (40) 

From  the  form  of  the  spin  matrices  we  realise  that  a  vector  field  is  a  spin  1  field 
with  eigenvalues  s  equal  to  1  and  //  =-  0,  ±  1.  The  spin  eigenfunctions  /q,  ^^^ 
are  also  given  below 


(41) 
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(42) 


The  spin  eigenfunctions  Xii~  ^  ^^^  orthogonal  and  they  form  a  basic  set  in  terms 
of  which  any  vector  /  can  be  resolved 

f=Si,t„  (43) 


I.e. 


h-fz\       f±-^p[L^ify^ 


(44) 


We  may  choose  the  components  of  the  polarization  vector  e  to  be  identical 
with  ^^.  The  two  possible  polarizations  correspond  exactly  to  the  two  values 
of  the  spin  projection  [a,  =  ±  1 .  If  p  the  momentum  of  the  photon  is  taken  along 
the  2-axis,  the  transversality  condition  excludes  the  state  Xq.  Hence  %^  and  x-\ 
are  the  allowed  spin  states  where  //  =  1  corresponds  to  right  circular  polarization 
and  fx  =  —  1  to  left  circular  polarization. 

Angular  momentum  eigenfunctions 

^^  fjui^^^)  ^^^  ^^^  simultaneous  eigenvectors  of  M^  and  M^ 

M^fjM{k,oc)^j{j+l)f^M{k,oc)  (45), 

MJ^Mi1^,oc)  =  Mf^Mik,oc).  (46) 

Since  the  spin  of  the  photon  is  equal  to  1 ,  the  orbital  angular  momentum  eigen- 
value I  for  a  given  j  can  only  be 

l=j     or     l=j±l{j^O).  (47) 

^  dO  indicates  differentials  of  the  angular  variables. 
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Hence  the  wave  functions  /y^^^  are  got  by  the  superposition  of  the  products  of 
orbital  and  spin  functions  99^  ,^  and  isn  subject  to  the  rule  for  the  projection 
values  of  m  and  n 

m+ILi  =  M.  (48) 

Hence  the  total  angular  momentum  wave  function 

fjiM{k,oc)=     Z    C{l,s,j;mfj,M)(pi^{k)Xs,Aoc)  (49) 

m  +  ,11  =  M 

Avhere  the  C's  are  the  usual  Clebsch-Gordon  coefficients.  For  a  given  value  of  / 
and ilf  these  coefficients  will  form  an  orthogonal  matrix.  We  can  decompose  /^^^v/ 
in  terms  of  the  spherical  basis  ;^^  and  obtain  the  components  of  vector  fju^j  as 

itiM  =  aC{l,  s,  j;  M-  fx,  ju,  M)  Y^^-^  =  a{k)  ^fjj^  (50) 

Avhere  '^^im  is  called  the  spherical  vector  function.  Since /^^j^  form  an  orthonormal 
set  expressed  by  the  condition 

jftlufyi'M'  d^^  =  ^ij'  ^W  ^MM'  (51) 

it  follows  that  the  spherical  vector  functions  are  an  orthonormal  set 

j%M  %'l'M'  do  =  djy  div  Smm'  .  (52) 

Parity  eigenfunctlons 

The  fjjif  functions  can  be  expressed  as  a  linear  combination  of  the  three  /y^^j 
functions  which  span  the  space  completely.  Ignoring  the  radial  functions  we  write 

fjM=   Z  Qi%iM'  (53) 

i  =  j-i 

Since  /^j^  should  satisfy  the  transversality  condition 

fm-n  =  o  (54) 

where  n  is  a  unit  vector  in  the  direction  of  momentum  there  are  only  two 
independent  photon  states  ff^  with  given  quantum  numbers  j  and  M  where  the 
allowed  values  of  A  are  A  =  0  and  A  =  +  1 .  These  are  the  two  parity  eigenfunctions 
for  given  j  and  M. 

To  obtain  these  //^'s  explicitly,  we  combine  the  three  '^jim  vectors  in  k  space 
to  give  three  mutually  perpendicular  spherical  vectors  *?)5  j| ^  which  is  longitudinal 
and  ^^j^  and  '^flf  which  are  transverse.  Remembering  that  the  unit  vector  in 
the  spherical  basis  has  components 

7^o  =  cos0;     n^i  =  ^ -—sind e^^'f"  (55) 

we  obtain  the  longitudinal  vector  ^^~m^  SiS  a  combination  of  ^jj^i,m  ^^d  '^j^j+i^m 
®5'M'  =  l/2y^m,-i.M-|/^vti_.^..,,,,.  (56) 
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The  other  two  desired  transverse  spherical  vectors  are  seen  to  be 

?),^°ir  =  ?)»M  (57) 

and 

i ^%  =  [M  X  ?)?if] or  Wli  =  |/2y^?)«>i,M  +  |/y-^ %.i-i.M.     (58) 

These  ^]m  ^^^  normalized  and  are  orthogonal  in  the  same  way  as  ^jim  and  we  have 
/?)ii&  Wm'  do  =  dij,  Smm'  Sw  (59) 

The  photon  eigenfunction  for  an  angular  momentum  eigen  state  with  given  / 
and  M  is  ... 

fiM^Qi^M  +  QoPrM  (60) 

where  ^^  and  Qq  are  arbitrary.  This  degeneracy  in  fj  ^  can  be  split  by  the  application 
of  the  parity  operator  P  defined  by 

pfik)  =  ±  m)  =-  ±f(-k)  (61) 

We  know  that  the  spherical  functions  F^^Cfe)  have  a  parity  ( — 1)^  and  hence  the 
spherical  vector  functions  %im   {n)  have  a  parity  ( — 1)^^^ 

P%iM{n)  =  {-iy^^%iM{n)  (62) 

For  transverse  spherical  vectors  we  have 

pWM^i-iy^M  (63) 

i'?)?M  =  (-l)'"^?)fi  (64) 

Hence  the  photon  states  can  be  given  by  the  '^jmM  functions  uniquely.  A=  1  is 
called  an  electric  state  and  A  =  0  is  called  a  magnetic  state.  It  can  be  seen  that  the 
parity  states  of  /  correspond  to  the  parity  of  the  H  field  of  the  radiation  which 
is  ( — 1)^  for  the  electric  state  and  ( — 1)^  +  ^  for  the  magnetic  state.  These  names 
"electric"  and  "magnetic"  states  correspond  to  the  fact  that  the  emission  of  the 
photon  in  these  states  arises  from  the  electric  and  magnetic  moment  respectively 
of  the  system  of  charges.  For  a  photon  with  a  given  energj^  value  \p  \  in  the  small 
range  d  |p  |,  the  radial  function  is 

a{k)  =  ce-^P^H[\k\  +  {\p\  +  d||>|)]^[|fe|  -(|p|  -  d\p\)]  (65) 

and  since  a  (k)  is  normalized  to  unity 

c==iiyp^dp/  (66) 

Hence  the  photon  state  characterized  uniquely  by  the  four  quantum  numbers 
corresponding  to  the  energy  p,  total  angular  momentum  j  and  its  projection  M 
and  parity  A  is  given  by  the  eigenfunction 

/^;MA  =  -j^--?)j3fe-''"//[fc  +  (p  +  dp)]ff[fc-(p-dp)].  (67) 


I 
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It  is  to  be  noted  that  when  A  =  1  the  spherical  vector  function  fj^M  i^  ^  linear 
combination  of  two  '^jim  functions  with  different  I  values  and  when  A  =  0,  the 
total  angular  momentum  is  equal  to  the  orbital  angular  momentum.  This  ex- 
plains the  well-known  statement  that  it  is  not  possible  to  divide  the  angular  momentum 
of  the  photon  into  an  orbital  and  spin  part. 

A  photon  which  has  total  angular  momentuni  value  ;  =-  0  has  only  one  spherical 
vector  function  corresponding  to  it  namely  ^o.i,o  which  is  according  to  the  equa- 
ions  (56)  a  longitudinal  spherical  vector.  A  spherically  symmetric  vector  field 
has  total  angular  momentum  equal  to  zero  and  can  be  represented  only  by  a 
longitudinal  photon  which  does  not  exist. 

Thus  the  wave  function  of  the  photon  in  an  arbitrary  state  /  can  be  expanded 
in  a  series  of  eigenfunctions  as 

f-    U  Cp^^^'-Ujm?.  (68) 

pi  MX 

and  the  expansion  coefficients  are  obtained  by 

C^>'^'=fft*iMAd'k.  (69) 

Expansion  o!  the  electric  and  magnetic  fields  in  terms  of  the  photon  eigenfunctions 

We  can  express  the  electric  and  magnetic  fields  E{x)  and  H{x)  in  terms  of  the 
eigenfunctions /^yjjf;i  of  the  photon  as 

Using  the  well-known  expansion  of  e*''"^  in  spherical  functions 

e-'-  =  2;9i{  kx)  )-f„(  J-)  r,„(^)  (72) 

we  obtain  integrals  of  the  type 

/?)i1i/(^)e''"'do  =  ?,(^*)?)i«(^)  (73) 

where  Qiikx)  are  related  to  Bessel  functions  Ji  +  ^i^{kx)  bj^ 

g,(lcx)  =  (2nriH'^l^^l.  (74) 

Expressing  E{x)  and  H{x)  as 

E{x)=^^{x)  +  ^*{x)  (75) 

H{x)  =  ^{x)--^^{x)  (76) 
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we  obtain  for  the  electric  state,  i.e.  A  =  1  the  following  values  of  ©(a?)  and  §  (a?) 
in  terms  of  the  ^jimix)  and  gi(kx)  functions. 


,j,M,+l 


fp,j,M,+l 


—  yp^  d  p 


M 


4:71  I  ^ 


[gj(v^)^im^-''''] 


(77) 
(78) 


For  the  magnetic  state  we  have  A  =  0 
i^/p^dp 


'■'P,j,M,0 


4:71  I  ^ 


9i{P^)'^jjM^~'^^ 


(79) 


L  r       /  T-  r       /  T  J  ^g^^ 

The  normalization  implies  that  there  is  a  single  photon  in  the  energy  range  p 
and  p  +  d^.  It  is  clear  that  the  §  field  in  electric  state  and  the  ®  field  in 
magnetic  state  are  represented  by  the  ^j^^  =  '^jjm  functions  which  are  trans- 
verse vectors  in  r  space  also.  Hence 

Iip,j,M,+i '  30  =  EpjM,o  •  a?  =  0.  (81) 

The  electric  field  in  the  electric  state  and  the  magnetic  field  in  the  magnetic 
state  can  be  split  into  transverse  and  longitudinal  vectors,  i.e.  ^Jj^^  functions 
and  f}^~M  functions  respectively 


>p,j,M,+l 


4:71 


^[^frr*^--^-)^'^--^ 


+ 


(2jTT'^^^^4jTT'^-^)^- 


ipt 


(82) 

^p,  j,  M,  0  =  *  ^p,  j,  M,+l'  (83) 

At  large  distances,  i.e.  for  ^o:  >  1  the  first  term,  i.e.  the  radial  component  vanishes 
since  Qj+i"^  Qj-i  and  the  field  becomes  in  the  radiation  zone, 

/  71  j\ 

cosU^a;  —  71 --1 


^p,j,M,+l  —        '^  ^p,j,M,0 


ii  +  1 


P'^PWlk 


Thus  it  is  possible  to  express  any  electromagnetic  field  in  the  form 

pjM^ 


e-»2".      (84) 


(85) 


Potentials  and  their  expansions  in  terms  of  the  eigenfunctions  of  angular  momentum 
and  parity 

Since  it  is  possible  to  specify  an  electromagnetic  field  by  the  vector  potential  A 
and  the  scalar  potential  A^  instead  of  the  electric  and  magnetic  fields,  we  shall 
conclude  this  chapter  by  expressing  A  and  ^4  in  terms  of  the  eigenfunctions 
of  parity  and  angular  momentum. 
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I 


For  a  given  electromagnetic  field  the  potentials  A  and  A^  are  defined  by  the 
equations 

E=-^-FA^;     H-curM.  (86) 

ot 


The  above  equation  (86)  can  be  written  in  terms  of  A{k)  and  A^{k)  the  Fourier 
transforms  of  ^(a?),  A^{x)  as  well  as  in  terms  of  E{k)  and  H{k)  as 


ith 


E{k)  =  —A{k)  —  ikA^{k) 
jff(fe)-  i{k  X  A{k)). 

splitting  A  into  transverse  and  longitudinal  parts 

A  =  B  -^n(p     ln=—] 

k'B{k)=-.0 
[the  transversality  condition  ri'  E  =  0  yields  the  relation 

iJcA^  +  9?  =  0. 
It  is  easy  to  see  by  equation  (3)  that  B  and  B  are  related  to  E  and  E  as 

Hence  B  and  B  can  be  put  in  terms  of  photon  wave  functions  /  and  /*  as 

—  i 


(87) 
(88) 

(89) 


(90) 


(91) 


B(k) 


B(fc)  = 


47r"'.  |/fc 


[/(fe)-/*(-fc)] 


l/fc 


(92) 


4n'l- 


[/(fe)+/*(-fc)] 


The  gauge  invariance  condition  according  to  which  E  and  H  do  not  change  if 

-^ —  is  added  to  ^4  and  VA  is  added  to  A  is  expressed  by  the  equation  already 

c  t 

given, 

ikA^  +  (^  =  0 

Similarly  the  subsidiary  Lorentz  condition  for  the  potentials 


div4  +  ^  =  0 
ot 


gives  another  relation  for  A^  and  99, 

ik(p  +  ^4=0. 

EPCE,      4 


(93) 


(94) 
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Note  that  equations  (90,  94)  are  just  like  the  equations  connecting  E  and  H. 
Hence  iiA^{k)  is  real,  we  can  introduce  complex  scalar  functions  /o(fe)  given  by 


1 
471^*  ]/k 


^4(fe)  =  TZi7r^7r[/oW  +  /o*(-'^)]    ' 


^■4(fe)=-^-?-[/o(fe)-/o*(-fe)] 


(95) 


and  /o  satisfies  the  Schrodinger  equation 

dt 
So  we  obtain  for  A  {k) 


i%  =  ^/«.  (96) 


where 

hik)^f{k)+info{k). 

The  expectation  value  of  the  photon  energy  can  be  written  as 

(x)=j1cPfdi^k=^JTc{h*h-f^U)d^k.  (98) 

Similarly  the  expectation  values  of  linear  momentum  and  angular  momentum 
are  given  by 

p:=jk{h*h-f*f,)d^k',     M^j{htMK-ftLk)d^k  (99) 

where  M  is  the  total  angular  momentum  operator  and  L  the  orbital  angular 
momentum  operator. 

If  we  want  to  obtain  the  potentials  corresponding  to  the  angular  momentum 
and  linear  momentum  eigenstates  of  the  photon,  we  write  down  the  angular 
momentum  and  linear  momentum  eigenfunctions  of  /q  and  h, 


and 


hv  =  c--j^e-^^^ H[k  +  {p  +  Ap)]H[k-  {p  -  Ap)-]  (100) 

K^  -  yj^  (e^  +  *  c  n)[H  [k-{-{p  +  Ap)]H[k-{p-A  p)] ,        (101) 


where  c  is  an  arbitrary  constant. 

For  the  functions  /q  the  angular  momentum  operator  is  only  the  orbital  momen- 
tum and  the  eigenfunctions  /^  for  angular  momentum  j  and  its  component  M  are 

foiM  =  Y^^YjMe-^^'Hlk  +{p  +  A  p)]H[k  -  ip-  A  p)  (102) 

and  the  parity  of  f^jj^^  =  ( —  1  y,  i.e.  that  of  an  electric  state.  Hence  for  the  magnetic 
state  /oy^y  vanishes,  i.e. 

kjMQ  =  0  and  h^MQ  =-"  fjMQ  (103) 
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For  the  electric  state 


KjM..i  =  J^~-  (Wm  +  icn  Yjm)  e-i^'H[k  +  {p  +  A  p)]  H\h  -  {p-Ap)] 


(104) 


The  choice  of  the  constant  c  determines  h  and  if  c  =  i  / =- ,  h  is  proportional 

to  the  spherical  function  Dy,;_i.M- 

(105) 
kp}M..i  =  ]Ij~Y  ]l^~Ap^^^'^~'^'^^^  -^p  +  Ap]H[k-  ip-Ap)].       (106) 

From  these  eigenfunctions  h  and  /q,  we  can  find  the  potentials  A  and  ^q  if 
we  set 

^(r)  =  %(r)+^*(r)  (107) 

^4(^)  =  ^4W  +  3lJ(r).  (108) 

P    We  determine  the  potentials  for  the  plane  waves : 


QHP-^--PiO 


(109) 


(110) 


From  the  spherical  wave  solution,  we  obtain  for  %pjM  ^^^  ^4pjM  ^^^  following 
expressions  if  we  set  c  =  —  y- —  : 


For  the  electric  state 


«— •S^I/ttt^-?^— ^'" 


and  for  the  magnetic  state 


for  arbitrary  c. 


^AvJMO  =  0 


(111) 

(112) 

(113) 
(114) 


CHAPTER  II 

QUANTUM  ELECTRODYNAMICS 

1.  PERTURBATION  EXPANSIONS  AND  KERNEL  FUNCTIONS^ 

We  shall  now  examine  how  time  dependence  other  than  that  of  the  ex- 
ponential type  enters  into  the  wave  function.  Let  us  consider  a  particle  in  the 
stationary  state  corresponding  to  a  time  independent  Hamiltonian  Hq  and  let 
the  eigenfunctions  of  Hq  be  99^  (a^)  e"  *^»*  where  E^  represents  the  eigenvalue  of  ^^ . 
By  time  independence  we  also  imply  that  Hq  exists  for  all  time,  i.e.  from  ^  =  — 00 
to  ^  =  +00.  If  a  particle  is  found  in  any  one  of  these  eigenstates,  it  will  continue 
to  do  so  for  all  time.  Time  dependence  other  than  that  of  the  exponential  type 
«can  occur  if  the  wave  function  ip  satisfies  the  Schrodinger  equation 

i^^:=H(t)f(t)  .  (1) 

where  H  is  time  dependent.  In  any  physical  process  we  deal  with  the  Hamiltonian 
JH{t)  such  that  H  -^  Hq  as  t  ^>-  +  00  and  t  ^>'  —  00.  This  implies  that  if  we  can 
write 

H  =  H,+  H'  (2) 

"where  H'  is  called  the  perturbing  Hamiltonian,  ^'  ^-  0  as  ^  ->  +  00  and  t  ->  —  00. 
But  if  we  wish  to  insist  that  W  should  be  in  operation  for  an  infinite  period  of 
time  the  above  distinction  between  H'  and  H  can  still  be  kept,  provided  we  set 

i/'  =  0         for  ^  <  Tq 

H'  =  H'{t)  for  TQ<t<T  (3) 

H'  =  0         for  t>T 

:and  let  T  and  Tq  tend  to  +  00  and  —  00  respectively.  Even  if  H'  does  not  vary 
with  time  in  the  interval  Tq  <C  t  <C  T  it  is  still  in  a  strict  sense  time  dependent 
«ince  we  have  assumed  H'  to  vanish  outside  this  interval.  We  shall  obtain  the 
solution  of  (1)  quite  generally  by  expressing  the  wave  function  at  2^  in  terms  of 
the  wave  function  at  Tq  and  later  discuss  the  limiting  process  as  T  and  Tq  tend 
to   +00  and  — 00  respectively. 


1  R.  P.  Feynman,  Phys.  Rev.  76,  749.  See  also  for  example,  the  semi-expository  paper 
Tjy  Alladi  Ramakrishnan  and  R.  Vasudevan  on  "Perturbation  expansions  and  Kernel 
functions  associated  with  single  particle  wave  function",  Proceedings  of  the  Summer  School 
of  Theoretical  Physics,  Mussoorie,  India  (1959), 
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If  we  assume  that  (p^  's  form  a  complete  orthonormal  set  it  is  clear  that  \p  {x\ 
can  be  expanded  in  terms  of  99^'s  as 

n 

=  2  Kit)  Wn(^)  =  Z  cM  cPni^)'-  (5> 

n  n 

We  note  that  c^^(t)  and  h,i(t)  satisfy  the  following  equations 

^^^^=ZH'nmCr>^{t)+EnCn{t)       •  (6> 

0.1  ffi 

i^ML^2;H'„„{t}b,,{t)  (7) 

at  rn 

Km^jcpt{x)H'cp^(x)di^x  (8) 

H'nmit)  =jxpt  W  H'xp^ix)  d^X  .  (9) 


where 

and 

This  implies  that 


smce 


TI'       (f\  _    Zf'        p.i{E^-Em)t 

=  e^HotH^^e-iH,t  (10) 

yj^{x)  -  e-^Hof(p^(x)  -  e-^^-Vm(«)-  (H) 

The  equation  for  c{t)  can  be  put  in  the  vector  matrix  form  as 

.  dc{t) 


dt 


[H^]c{t)  +  [Ho]c{t)  (12) 


where  c  (t)  is  a  column  vector  with  typical  elements  c^  {t),  [H']  and  [Hq]  are  matrices 
with  elements  H'nm  ^^d  E^^d^n  respectively.  We  write  the  vector  matrix  equation 
for  h(t)  below  as  we  are  going  to  discuss  it  in  detail. 

.dh{t) 


dt 


[H'{t)]h(t)  (13) 


\ 


where  {H' (t)]  is  the  matrix  with  typical  elements  H'^^{t).  If  we  work  with  i(^)'s 
instead  of  c(^)'s  we  need  only  the  matrix  representation  of  the  perturbing  Hamli- 
tonian  H'  remembering  of  course  the  relation  between  H.'^^{t)  and^^^.  When 
discussing  the  field  theoretic  formalism  we  wiU  be  able  to  identify  c(0  ^^d  h{t)  as 
the  two  representations  of  the  same  states  called  the  Schrodinger  and  the  inter- 
action representations.  We  shall  not  go  into  this  nomenclature  any  further  now, 
but  proceed  to  discuss  the  equation  for  b{t). 

The  calculation  of  the  coefficients  b^it)  constitutes  the  main  problem  in  any 
quantum  mechanical  collision  process.  There  are  at  present  two  main  methods 
of  calculation : 


^  W.  Heitler,  Quantum  Theory  of  Radiation,  p.  139,   Oxford  University  Press,  London 
(1954). 
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(i)  The  old  fashioned  perturbation  theory  relating  to  single  particles, 

(ii)  The  kernel  function  formalism  of  Feynman. 

We  are  concerned  only  with  wave  functions  of  a  single  particle  with  6„(^) 
representing  the  amplitude  for  the  particle  to  be  found  in  a  state  n.  It  is  to  be 
noted  that  only  transitions  between  positive  energy  states  are  envisaged  here. 
Later  we  will  be  using  the  same  equation  with  a  generalized  interpretation  for  the 
Hamiltonian  so  as  to  include  systems  of  many  particles.  This  corresponds  to 
including  the  negative  energy  states  in  the  kernel  function  formalism  and  its 
consequent  modification  by  Feynman^. 

By  a  solution  of  the  equation  (13),  we  mean  that  given  h  (Iq)  we  have  to  determine 
6  (t)  for  ^  >  ^0 .  From  elementary  iteration  we  obtain 

Ht)=[U{t,Q]h[Q  (14) 

where  U{t,  Iq)  is  the  expansion 

t  t  T^      . 

U{t^.to)  =  l  +  {- i)JH'{T^)  dTi  +  (-  i)2/dT2/dTi H' (r^)  H' (tJ  +  •  •  • 

...  +  (-  *)^/dT;fc/dT,_i  '••fdr.H'  (T,)   .  .  .  ^'  (Ti)  +  •  .  .  (15) 

Iq  If)  Jo 

The  physically  interesting  case  is  obtained  by  setting  t  ^  T  and  ^q  =  ^c  ^^^^ 
the  times  corresponding  to  "switch  off"  and  "switch  on"  of  the  perturbation  and 
letting  T  and  T^  tend  to  +oo  and  — oo  respectively.  In  the  above  integration 
we  have  ordered  the  time  parameters  in  the  following  way 

In  a  physical  problem  since  we  are  interested  in  computing  the  amplitude  of 
transition  from  a  definite  intial  state  to  a  definite  final  state,  we  assume  that  all 
the  elements  of  6(^o)  ^^^  ^^^^  except  a  specified  h^(t^  which  is  set  equal  to  unity. 
Denoting  hj{t)  the  typical  element  oih(t)  for  obvious  reasons  as  hfi(t)  we  can  write 

b,S)=(.f\8\iy=Z<f\S^''^i>  (16) 

n 

with 

8=     lim     U{t,tQ) 

t  =T  ->+oo 
<o  =  To->-oo 

and 

^(-)  =  (-irfdr^f  dT,_i  .  ..fdr.H'M  ^'(r«-i)  •  •  •  H'{t,)  (17) 

where  for  the  present  we  have  adopted  the  notation  that  |i>  is  the  vector  b 
with' all  its  elements  zero  except  b^  which  is  set  equal  to  unity  and  |/>  is  a 
vector   with  all  its  elements  zero  except  bf  which  is  set  equal  to  unity.  By 


W.  Heetler,  Quantum  Theory  of  Radiation,  Oxford  University  Press,  London  (1954). 
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definition  each  element  of  [H'  (t)]  is  an  integral  over  space  and  therefore  we  define 

Kmi^,  t)  -  Wn*  {X)H'  {x)  xpm{x)  (18) 

SO  that 

H'nm{t)=lRnm{x)d^X.  (19) 

Thus  we  write  the  expansion  as 


<f\S\i}  =  b{t) 


l  +  {-i)  f  d^x,f[E{x,)]dr,+ 


+  (-*)*  fd^xj,  Jd^Xk_-i"'  j  d^x^j  dTj,j  dr^^^-"  j  dr, 

-oo  -oo  -oo  ^0  to  to 


[R{x,)][R{x,_,)]  .  •  .  [R{x,)]b  it,)  + 


6^(g==.0     if     h4=i,  bf,{t)=0     if     k^f. 


(20) 


The   difference   between  the  old-fashioned  perturbation  theory  and  the  kernel 

»  function  formalism  is  just  the  following:  In  the  former  we  perform  the  space 
integration  first  and  the  time  integration  subsequently  while  in  the  four  dimen- 
sional formalism,  the  integration  is  performed  over  all  the  four  variables  simul- 
taneously. 


Old  fashioned  perturbation  theor>^ 

We  shall  obtain  the  probability  per  unit  time  that  a  particle  makes  a  transition 
from  the  state  i  to  the  state  /  under  a  perturbation  H'.  Taking  the  first  order 
term  in  the  expansion  we  have 

t  oo 

</|>Sf(l)|^>=  lim   {-i)ldT,fd^x,y^f{x,)H'{x,)yji{x,).  (21) 


t  ->  +  oo 

Assuming  ipf  and  y)^  are  free  particle  eigenstates  of  energy  we  can  write  them  as 

y}f  =  (pf  e~''^p/^     ipi  =  (fi  e~^^Pi^  (22) 

where  p^  and  pf  are  initial  and  final  momenta  of  the  particle.  Let  us  also  assume 
that  H' [x)  can  be  written  as  W {x)e^'*'^*^,  i.e.  using  — iq^t  in  the  exponent  when 
the  interaction  imparts  an  energy  q^  and  +iq^t  when  it  takes  away  an  energy  q^ 
from  the  particle.  Thus  we  write 

t 

bfi  =  Hjii-  i)  f  G-'^Pf^  Q±iQjQiE^it  ^t  (23) 

where 

HU=j(pf{x)H'{x)(pi{x)d^x  (24) 
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i.e.  we  are  performing  the  space  integration  first  and  subsequently  the  time 
integration.  For  performing  the  space  integration,  we  first  notice  that  the  space 
dependence  of  9?^  and  99^  occur  only  through  the  factors  e^'*<^and  e***'*  respectively. 
Let  us  assume  therefore  that 

fp.  =  e+iPi-x^     JS'[x)^e^^9-'';     cpf  =  Q^^Pf"^ ,  (25) 

i.e. 

'ip.^Q-iPi-x^     H'{x)  =  Q-^^'^\     ipy  =  e-^Pf^,  (26) 

ignoring  the  normalization  factors  which  are  not  relevant  for  the  present  dis- 
cussion. The  fact  that  we  take  H'{x)  =  e~**'^  implies  the  absorption  of  four  momen- 
tum q  with  q^  the  energy  positive  (i.e.  we  are  considering  a  first  order  electrodynamic 
interaction  of  an  incident  photon  and  an  electron).  If  on  the  other  hand,  we  are 
considering  a  first  order  emission  (i.e.  when  there  is  only  an  incoming  electron) 
we  write  H'{x)  =  e**^.  Considering  the  case  of  absorption  and  integrating  with 
respect  to  space  in  Hj^  we  obtain  as  a  factor  the  ^-function  {27z)^d{pf  — p^  —  q) 
and  with  respect  to  time  we  obtain  the  factor  2nd(Ep^  —  Eq  —  Ep.).  This  implies 
that  the  matrix  element  will  vanish  unless 

Pf-Pi+q    and    E^^  =  Eq  +  E^^.  (27) 

It  is  easy  to  see  that  the  second  condition,  i.e.  the  energy  conservation,  cannot 
be  satisfied  if  we  assume  the  first.  In  other  words  if  we  add  the  two  four  momenta 
p.  and  q  we  obtain  a  four  momentum  which  does  not  satisfy  the  energy  momentum 
relation.  Thus  there  cannot  be  a  first  order  process  in  which  an  electron  can  absorb 
a  photon.  In  a  similar  manner  we  find  that  there  cannot  be  a  first  order  emission 
of  a  photon  also. 

However  if  the  time  component  q^  of  q  or  more  generally  oiH'{t)  is  zero,  the 
energy  momentum  relation  for  Pi  +  q  still  holds.  An  electromagnetic  potential 
which  is  only  a  function  of  space  and  is  independent  of  time  can  be  written  as 

A{x)=Ja(q)d(q^)e-'^^-'^d^q 

=fa{q)e^9l''d^q.  (28) 

This  implies  that  a  potential  is  a  superposition  of  time  indpendent  waves  with 
q^=  0  and  in  the  interaction  the  particle  can  acquire  a  momentum  q  with  an 
amplitude  a  (q).  Given  p^  and  p^  we  can  choose  a  q  such  that  pf  — p^  =  q  (with 
^4  =  0)  i.e.  a  first  order  scattering  in  an  electromagnetic  potential  is  possible. 
Taking  ip^  and  ipf  as  given  in  equation  (26)  and  H^  SiS  A{x)  the  matrix  element 
for  the  first  order  process  will  be  proportional  to  a{q). 

At  this  stage  it  is  necessary  to  explain  the  meaning  of  the  initial  and  final 
states  in  the  old-fashioned  perturbation  theory.  In  the  case  of  absorption,  the 
initial  system  consists  of  an  electron  and  a  photon  and  the  initial  state  i  refers 
to  this  system.  The  final  system  is  just  the  electron  and  the  final  state  refers  to  it. 
The  energy  of  the  initial  system  is  therefore  given  by  Ei=  Eq  -{-  Ep.  and  the  energy 
of  the  final  system  is  Ef  =  Ep.  If  on  the  other  hand,  there  is  an  emission,  the 
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initial  system  consists  of  just  an  electron  and  the  final  system  of  an  electron 
and  a  photon.  In  such  a  case  we  have 

Ei=E^^    and    Ef  =  E^^A-E^.  (29) 

In  the  above  discussion  about  energy-momentum  conservation  we  have  re- 
presented the  eigenstates  of  energy  by  simple  wave  functions  of  the  tjrpe  e~*^'^. 
The  arguments  are  valid  where  the  wave  function  of  a  particle  has  more  than 
one  component  (as  in  the  case  of  spinors)  since  for  our  argument  it  suffices  that 
any  eigenfunction  of  energy  must  be  written  in  the  form 

y>i(t)-=<fie-*^p',  (30) 

i.e.  the  only  dependence  on  time  is  through  e"*^^^  It  will  be  convenient  to  write 
the  first  order  matrix  element  as 

<f\8^^^iy==-27tid{Ef-E,)Hfi  (31) 

In  a  wave  function  formaUsm  dealing  with  the  interaction  of  an  electron  with 
an  electromagnetic  field  the  perturbation  term  itself  contains  the  wave  function 
of  the  photon  (or  the  electromagnetic  field)  which  is  part  of  the  system  before  the 
interaction  if  the  perturbation  represents  an  absorption,  and  is  part  of  the  system 
after  the  interaction  when  it  represents  an  emission.  If  we  use  the  notation  j  iy 
for  the  initial  system  it  cannot  be  strictly  identified  with  6(^o)  with  all  except 
its  ^th  element  zero,  for  the  latter  represents  the  state  of  the  electron.  This, 
difficulty  will  not  arise  as  we  shall  see  in  a  field  theoretical  formalism  where  the 
state  at  t  refers  to  the  entire  system  at  t  and  the  interaction  represents  operators- 
which  cause  transitions  in  these  states.  In  the  Feynman  formalism,  the  states 
always  refer  to  the  electron  and  the  inequality  of  the  energy  between  the  states 
before  and  after  the  interaction  is  imbedded  in  the  concept  of  the  virtual  state 
of  the  electron. 

Thus  for  the  sake  of  uniformity  of  notation  with  the  field  theoretical  formalism 
we  shall  write  <f\S\iy  for  bfi  and  <f\H'\iy  for  Hj^.  Thus 

<f\S<^^\iy  ^-<f\H'\iy2nid[Ef-E,).  (32) 

When  the  time  component  of  ^'(0  is  not  zero,  the  first  order  interaction 
vanishes  and  we  take  the  second  order  interaction 

<  Tg  OO  OO 

</|  >S(2)  \iy  =  {-i?S  j  dT2  /  dTi  /  diH^  I  d^x,  xpf  [x^)  H'  {x^)  y^^{x^) 

^  to  h  -oo  -~ 

wli{^i)H'{x^)Wi{^i) 
=  {-i)^2<f\H'\my<m\H'\iy  (33) 

m 

as  J dr/f  e-^^^rn-Ef)r..-i{E,-E^)r,,  dr^^ 

to  to 

tn^-  —  oo,      ^  ->  +  oo 
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the  summation  over  m  implying  summation  (or  integration)  over  all  possible 
intermediate  states. 

In  the  case  of  the  first  order  matrix  element  we  have  made  use  of  the  lemma 

t 
Hm      fe^i^f-^i^^dT^27td{Ef-Ei).  (34) 

t    ■*-  +  CXi   f 

In  the  second  order  matrix  element  we  find  that  we  have  first  to  integrate  with 
respect  to  the  intermediate  variable  t^  between  the  limits  Tg  and  tg  and  let  only  t^ 
tend  to  — oo.  To  evaluate  this  integral  we  adopt  the  following  device.  We  multiply 
the  integrand  by  e^"^^  where  £  is  a  positive  infinitesimal  quantity  which  can 
be  made  as  small  as  we  please,  i.e.  £  ->  0.  Keeping  e  non-zero  and  performing 
the  integration  we  evaluate  the  integral  as 

Hm  — — - — — -.  (35) 

,^0    -i{Ei-E^  +  i8) 

The  second  integration  over  Tg  yields  the  delta  function  d(Ef  —  Ej)  i.e.  /  should 
be  on  the  energy  shell.  It  is  not  necessary  however  that  E^  should  be  equal  to  E^ , 
i.e.  the  summation  over  E^  will  include  values  both  off  and  on  the  energy  shell 
E  ^  E^.  The  separation  into  these  two  kinds  of  intermediate  states  will  by  made 
presently.  Writing y;^ (a^^)  and ipfix^)  as  e~*^»'^»  and  e"*^/'*^  respectively  and  taking 
H^{xi)  =-  e~**^"^s  H'(x2)^  qU^-x^  corresponding  to  the  absorption  and  emission 
of  a  photon,  we  have 

i  T„  oo  oo 

</|;S(2)|i>  =  (-*)2  2'/dT2/dTi  j d?x^  Jd^x^e'Pf'^^e'^^-^'- 

^    t„  ta  -oo  -oo 

Q-iPm-Xi  QiPm-Xi  Q-iqi-Xi  ^-iPi-x^  (3g) 

as  ^_>-L-oo^o^-  —  oo 

where  each  of  the  four  vectors  p  and  q  satisfy  the  correct  energy  momentum 
relation.  The  space  integration  leads  to  the  conservation  of  momentum  (i.e.) 

Pm  =  Pi+  qi,  Pf  +  q2  =  Pm  (37) 

As  mentioned  before,  the  initial  system  in  this  case  consists  of  an  electron  of 
four  momentum  p^  and  a  photon  of  four  momentum  g^,  (i.e.)  the  energy  of  the 
initial  system  is  given  by 

Ei  =  E^,  +  E,^.  (38) 

The  intermediate  state  is  that  of  a  single  electron  with  momentum  p^  +  ^i  and 
energy  ^^  =  E^.^q^.  The  final  state  is  that  of  an  electron  of  four  momentum  ^^ 
and  a  photon  of  four  momentum  q2  and  has  energy  Ef  =  E^^  +  Eq^.  While 
Ef  =^  E^  the  energy  E^  of  the  intermediate  state  is  not  equal  to  J^^  or  Ef,  i.e. 
energy  is  not  conserved  in  the  intermediate  state  though  momentum  is  conserved. 
In  any  physical  process,  when  we  consider  the  absorption  of  a  real  photon  of 
four  momentum  q^ ,  it  is  obvious  that  this  should  form  part  of  the  initial  system 
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tand  therefore  g^  and  p^  are  given.  Hence  for  a  required  pf  we  find  gg  is  determined 
and  hence  the  summation  over  the  intermediate  state  impHes  no  summation  over 
values  of  g^  or  q^ .  However  since  we  are  interested  only  in  the  emission  of  a  photon  q^ 
jBind  the  absorption  of  a  photon  g^  but  not  in  their  relative  sequence,  we  could 
|iave  taken  the  emission  of  the  photon  q^  at  Xj^  and  the  absorption  of  the  photon  q^ 
at  X2 .  In  this  case  the  initial  system  is  an  electron  of  four  momentum  p^  and 
a  photon  of  four  momentum  q^ .  The  intermediate  state  refers  to  three  particles 
viz.,  the  electron  of  momentum  p^  —  q^  and  energy  Ep'^=  -^{Pi-Qt)  ^  photon  of 
four  momentum  q^  and  a  photon  of  four  momentum  ^2  •  The  final  state  consists 
of  an  electron  of  four  momentum  pf  and  a  photon  of  four  momentum  gg  •  I^i  this 
case  we  have  the  contribution 


lim   i-i)^  2; Hf^Hlif  dr^f  dT^eii^f-^i^'^-eM^i~J^rnnr,-'^'>  (39) 

with 

Ef^Ep^+E,,^',  E,  =  Ep^  +  E,^  (40) 

</  \8^^^  \iy  gets  contribution  from  both  these  cases  and  hence  we  write  the  matrix 
element  as  the  sum  over  all  possible  intermediate  states. 

Thus  we  have  here  only  two  possible  intermediate  states  corresponding  to 
the  two  possible  sequences  in  which  the  absorption  and  emission  can  occur. 
More  generally  the  summation  should  be  interpreted  as  including  summations 
or  integrations  over  various  parameters  characterizing  the  intermediate  states. 

The  above  considerations  can  be  generalized  in  exactly  the  same  manner  to 
the  mth  order  process.  We  write  the  matrix  element  in  such  a  case  as 


r      <f\S(''^iy=-27iid{Ef-Ei) 

f                          ^             </  l^'l  m^Xmi  l^'l  m2>  •  •  •  <m.,_,  \H'\iy 

(41) 

^  {Ei  -  E,^^  +  i  £)  {E,  -Em,  +  ie)...{Ei-  E^^_^  +  i  s) 

Defining  </  ]  T^  | *>  throu gh 

</  IS^""^  \i>  =  -  2^id{Ef  -  Ei)  if  |T('*)  |i> 
and  writing 

we  note  that 

S=  l—27tid{Ef  —  Ei)T 

(42) 
(43) 

f                                                           ^  -  1  +  T 
where 

Bn  the  definition  of 

1                                         X/IS^Ii)      or      <i\T'-">\iy 

(44) 
(45) 

we  have  stated  that  the  intermediate  states  have  to  be  summed  over  states  both 
off  and  on  the  energy  shell,  i.e.  it  is  implied  that  if  we  take  any  typical  factor 
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Ei  —  Ejn  +  ^  £  in  the  denominator  and  replace  E^  —  i  e  by  a  complex  variable, 
we  are  integrating  over  the  real  axis  passing  through  the  singularity  at  Ei 
(when  e  ->  0).  This  is  equivalent  to  writing 

-F 1    -4-;p  ^  F   ^  F     -^^^{Ei-EJ.  (46) 

In  the  first  part  the  integration  over  m  has  to  be  understood  in  the  sense  of 
Cauchy-principle  value,  i.e.  omitting  the  contribution  from  the  immediate  neigh- 
bourhood oiEi.  The  second  part  represents  the  contribution  from  the  integration 
along  a  small  semi- circle  round  the  singularity  E^  when  the  radius  tends  to  zero. 
In  other  words,  in  the  first  part  the  energy  E^  is  off  the  energy  shell  while  in  the 
second  term  it  is  on  the  shell.  We  give  here  a  very  simple  argument  by  which 
a  relation  can  be  established  between  <^f\T\iy  and  the  part  </  |iL  |*>  which  does 
not  involve  an  intermediate  state  on  the  energy  shell. 

The  matrix  element  </|T  |*>  must  be  a  sum  of  two  mutually  exclusive  parts: 
one  involving  no  intermediate  state  on  the  energy  shell,  the  other  involving  at 
least  one  intermediate  state  on  the  energy  shell.  The  first  is  by  definition  </  |^  |^>. 
In  computing  the  second  let  |/^>  be  the  "last"  among  the  intermediate  states  on 
the  energy  shell  in  following  the  sequence  from  |^>  to  |/>  i.e.  the  state  on  the  energy 
shell  next  to  the  final  state  itself.  We  can  speak  of  such  a  sequence  even  in  the 
strictest  sense  before  performing  the  time  integration.  The  transition  from  |*> 
to  |4>  proceeds  through  T  while  that  from  |/^>  to  |/>  proceeds  through  iT.  Thus 
we  can  immediately  write 

if\T\iy  =  if\K\iy-i7zjif\K\U>it^\T\iy      '  (47) 

the  factor  in  occuring  due  to  integration  over  the  ^-function  part  (since  |/^>  is 
on  the  energy  shell)  of  \l{Ei  —  Ef^  +  i  e).  The  above  relation  can  also  be  obtained 
by  a  systematic  substitution  for  each  \l{Ei  —  E^  +  *  e)  in  terms  of  the  off  and  on 
the  energy  shell  components  and  suitable  rearrangements. 

We  shall  now  obtain  a  similar  integral  equation  for  <m  \T\iy  where  |m>  (unlike 
|/>)  is  not  necessarily  on  the  energy  shell  E^ .  The  above  equation  in  principle  is  valid 
when  we  replace  |/>  by  |m>  but  as  it  is  customary  to  define  the  iT-matrix  connect- 
ing states  on  the  energy  shell,  we  shall  obtain  <m  1 7^  |  i>  in  terms  of  <Z  |  ^  |  A;>  where  |  Z> 
and  \ky  are  not  necessarily  on  the  energy  shell.  We  use  the  following  argument. 

Either  (i)  a  transition  occurs  directly  from  \iy  to  |m>  and  the  ampHtude  for 
this  is  <m|^|*>  or  (ii)  there  is  at  least  one  intermediate  state.  Let  the  "last" 
intermediate  state  (next  to  m)  be  |Z>.  The  transition. from  |Z>  to  |  m>  is  through  W 
and  from  |*>  to  \ly  through  T.  Thus  we  have 

im\T\iy  =  im\H'\iyVZ  ^_\^_^-^<^r^\H'\ly<l\T\iy 


=  <jn\H' 


<m\H'\ly<l\T\iy 

I  Ei  —  El 


i7zjim\H^\t'y<r\T\iy  (48) 
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^fcwhere  if  the  "last"  intermediate  state  is  on  the  energy  shell  we  denote  it  by 

^l]/'>  or  else  by  |Z>. 

The  connection  between  matrix  elements  of  the  type  <,f\T  |i>  where  /  is  on  the 
energy  shell  Ei  and  <m  |^  |*>  where  |m>  is  not  necessarily  on  the  shell ^^  can  also 
be  seen  by  a  mere  examination  of  the  expansion  </  \T  |i>.  Taking  the  expression 
for  </ 1^^*^^  |^>  we  can  write  it  as 

►  </|r(i)|i>  =  </|fl'|i>  (50) 

«r 
</i^K>^</iHi.>+^<^fiy-ifi^>         ,51) 

==<f\H'\yj!t'>  m 

T\iy  =  Hy)\-'^  (54) 

T\iy  =  m\iy  +  m--l--^J\iy  (55) 


where  we  define 

We  thus  have 
and  recalling  that 

we  can  write 


We  can  also  establish  the  connection  between  the  matrix  elements  <m|T|*> 
and  </|T|*>  in  another  manner  by  just  observing  that  we  can  also  write 

with  </l^M^>  =  </l^'H>  (^^) 

or 


where 


I.e. 


=  <^}->|^'|i>  (60) 


Ef-E^  +  ie 


m    Jbjf  —  Jijj^  —  ie 


=  l/>  +  f7^„-^^'l^^">-  <«'» 
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We  shall  now  examine  the  meaning  of  ^^^^  and  y^^~^  solutions  in  greater  detail. 
This  we  can  do  by  remembering  that  the  matrix  element  </  |  iS  |  *>  on  the  energy  shell 
was  computed  by  first  obtaining  bfi{t,  Iq)  as  an  infinite  series  expansion  for  finite 
t  and  ^0  and  passing  to  the  limit  as  ^  ->  oo  and  tQ->  —  oo.  For  applying  this  limiting 
process  it  is  only  natural  to  assume  t  >  t^  and  obtain  bfi{t,  to)  as  the  element  of 
the  vector  b{t)  satisfying  (13)  with  the  "initial"  condition  that  5 (^q)  is  a  vector 
with  all  elements  zero  except  the  i-th  which  is  unity  and  represents  the  amplitude 
for  the  system  being  in  |*>-  But  we  know  that  the  solution  for  b{t)  is  meaningful 
even  for  ^  <  ^^  or  in  other  words  writing  b{t)=  U  {t,  Iq)  b  (Iq)  the  transformation 
matrix  U  {t,  to)  exists  for  ^  ^  ^^  as  has  been  elaborated  earlier. 

It  is  found  convenient  to  divide  the  solution  for  b{t)  into  two  classes. 

b^  (t)  given  b {to)  when  t  y  tQ,tQ^  —  oo 
b~  (0  given  b  {to)  when  ^  <  ^o»  ^o  "^  +  °°  • 
We  now  write 


<f\S\i}=   Urn    if\U{t,to)\i>  (63) 

t  -^  +  oo 

Urn  <f\U{t,t,)U{t„to)\iy 

t   -^  +  OQ 

t„-^-oo 

Hm  iU{tl,t)f\U{t^to)\iy 

t  ->-  +  (» 

=^Uf'r{t)h^^\t)  (64) 

where  we  have  used  the  lemma 

C7t(^,,0=  U-m^,t)^  U{t,t,)  (65) 

noting  that  in  the  above  definition  of  both  b^^^  and  b^~\  t  is  kept  finite  and  this 
has  the  consequence  that  the  elements  of  b\^^  {t)  are  non-zero  whether  they  are  off 
or  on  the  energy  shell -E^^  (similarly  for  bj~^{t)  on  the  shell  £^/).  When  t  is  finite  in 
the  integral  expansion  for  bi  {t)  the  last  integration  is  performed  from  —  oo  to  ^ 

hence  yielding  the  factor  ~ = ^~  .   1%^  we  make  ^  ->  oo  we  should  replace 

it  by  d(E^—Ei).  If  we  set  ^  =  0  we  note  that  b^i!'\0)  is  identical  with  the  xp^^^ 
we  have  defined  before.  In  a  similar  manner  we  identify  6/~^(0)  with  yff"*.  Thus 
if\S\iy  is  also  expressed  in  the  form 

</i>S|^>  =  <^<->|^H>.  (66) 

We  have  now  to  convince  ourselves  that  the  above  equation  is  consistent 
with  (52).  That  this  is  so,  follows  directly  from  the  equation  (51)  for  \p[^\  For  if 
we  substitute  for  ^J.+>  in  </  \H'  |^^^^>  successively  by  the  expression  (56)  we  obtain 
the  above. 


r 
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Kernel  function  formalism 

We  shall  now  write  the  integral  expansion  form  of  </|/S|i>  using  the  kernel 
function  formalism. 

In  the  expression  for  ^f\8\iy  given  above  we  replace  i?,^,^  by 

and  a  tjrpical  term  of  the  expansion  (say  the  (w  +  1)  -th)  becomes 

+  00  00  /  Tg 

lim    (-^^   /■  di^x^..  .   f  dH^  fdr^..  .  f  dr^ 

<o->-oo 

■■•Wl  {Xi  Tj )  H'  (ti)  ipi  (a?!  Ti)  (67) 

where  t^  are  ordered  from  ^^  to  ^.  If  we  define  the  function 

K{2,l)  =  2y>m{2)wla)  (68) 

m 

where  (1)  and  (2)  refer  to  the  space  time  points  [xi,  t^)  and  (a?2,  ^2),  the  above 
expression  is  written  as 

hm  {-if  f  dT„  . .  .  / dTi  / d^x,, .  .  .  J d^x^^ 

f^  +  °°  to  to  -00  -00 

to^-00 
xpf{n)H'{n)K{n,  n-\)H'{n-\)...H'(2)K{2,  I)  H' {I)  xpi{\) .         (69) 

K{2,  1)  as  defined  in  (68)  is  valid  for  all  points  and  satisfies  the  homogeneous 
equation 

^    i^K{2,  l)-H{2)K{2:i)^0  (70) 

but  in  the  expansion,  since  time  is  ordered  K{2,  1)  occurs  with  ^2  >  ^i-  We  can 
remove  the  restriction  on  time  ordering  if  we  define  a  kernel  Kq{2,  1)  which  is 
such  that  Kq{2,  1)  =  K{2,  1)  for  t^  >  t^  and  Kq{2,  1)  =  0  if  ^2  <  ^  •  ^0  then 
satisfies  the  inhomogenous  equation 

i4r  Koi^,  l)-H{2)Ko{2,  1)  =  ^(2,  1).  (71) 

(7^2 

But  there  is  nothing  in  the  Schrodinger  equation  itself  to  imply  that  its  solution 
should  not  be  valid  for  t^<^t^.  If  in  our  formulation  we  take  care  to  order  our 
time  in  the  integral,  it  is  immaterial  whether  we  take  K{2,  1)  or  Kq{2,  1).  The 
physical  interpretation  of  the  terms  in  the  expansion  can  be  had  if  we  imagine 
that  the  perturbation  acts  at  space -time  points  1,  2,  .  .  .,  and  the  "motion"  of 
the  particle  is  represented  by  a  propagator  in  between.  These  four- dimensional 
"vertex"  points  at  which  perturbations  act  are  integrated  out.  Thus  K^  is  the 


64  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

same  as  Feynman's  kernel,  the  important  difference  being  that  our  kernel  K 
obeys  the  homogenous  equation. 

This  formalism  of  perturbation  theory  is  called  non-relativistic  though  all  the 
expressions  used  here  are  four  dimensional.  This  is  perhaps  due  to  the  fact  that 
the  \p  functions  used  here  are  solutions  of  the  non-relativistic  equations.  We 
shall  be  discussing  the  kernel  function  formalism  for  the  Dirac  equation  in  the 
next  section. 

We  have  till  now  given  the  prescription  for  the  modes  of  calculation  of  </  |aS  |i> 
and  hence  the  probability  of  transition  from  a  state  *  to  a  state  /  in  the  interval 
t  —  t^  as  t^*-  +  oo  and  t^-^  —  oo.  But  the  physically  interesting  quantity  is  the 
transition  per  unit  time  into  a  small  interval  A  Bin  the  neighbourhood  of  this  final 
state  Ep  If  we  take  the  density  of  final  states  to  be  Q{E)dE  then  the  transition 
probability  for  the  infinite  interval  is  given  by 

co=\<f\8\iy\^Q(E)dE.  (72) 

We  have  seen  that  ^f\S\iy  can  be  written  as 

</  j^  I  *>  =  -27tid{Ef  -Ei)<f\T\iy,  (73) 

i.e.  we  have  separated  out  the  delta  function  singularity.  Thus 

\<.f\8\i  >  p  =  47t%d{E, - Et)f  </|  r  |0  P  (74) 

and  the  transition  probability  is  given  by 

co  =  4.7z^{d{Ef-Ei)Y<f\T\iyQ(E)dE. 

We  can  interpert  the  term  |(5  {E^  —  E^)Y  as  follows : 

t 
{S{Ef-Ei)f  =  d{Ef-Ei)   lim    J- re^(^»-^VH«-Od(^  -  y 

to 

S  {Ef  -  Ei) 


2  7t  t-to->oo 


/• 


lim      /  dr 

<0 


=  ^(-^/      %)_  (^  _  g  iij^f_  t^_^^^  (75) 

2  Ji 

Thus  the  transition  probability  during  the  time  interval  t  —  ^^  as  ^  -->  +  ^^  and 
Iq^-  —  oo  is  infinite.  However  what  we  are  interested  in  is  the  transition  proba- 
bility per  unit  time  which  is  given  by 
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2.  PERTURBATION   THEORY  OF  THE  DIRAC  ELECTRON^ 

Feynman  kernel  and  its  momentum  representation 

We  shall  discuss  a  perturbation  method  of  obtaining  the  solution  of  the  Dirac 
equation 

i^=(H,  +  H')y,  (1) 

where  Hq  is  the  free  particle  Dirac  Hamiltonian  (a  -  p  +  ^m)  and  W  is  the  inter- 
action term  which  we  are  going  to  treat  as  a  perturbation.  If  the  interaction  term 
is  due  to  an  electromagnetic  field,  W  is  identified  with  e  («  A  +  ^q))  where  A 
and  q)  are  the  vector  and  scalar  potentials  of  the  electromagnetic  field. 
Alternatively  the  Dirac  equation  for  a  free  particle  perturbed  by  an  electromagnetic 
field  can  be  written  as 

(p  — eA  — m)^  =  0.  (2) 

We  identify 

W^-e^k.  (3) 

The  extension  of  the  kernel  function  formalism  to  the  Dirac  equation  is  made 
in  the  first  instance  by  replacing  the  single  component  wave  function  by  the 
Dirac  wave  function  and  representing  the  interaction  term  H'  by  e^K.  Since  it 
is  customary  to  use  for  the  interaction  eh,  the  matrix  p  is  attached  to  the 
kernel  which  is  defined  as : 

K,(2,\)=-ZV>m('i)vlW^==   2  ¥l'fmW     for     k>t, 

=  0     for     ^2<^i  (4) 

where  the  sum  m  extends  over  both  positive  and  negative  energy  eigenstates. 
The  kernel  satisfies  the  inhomogeneous  equation 

(^V-  m)  Ko{2,  1)  =  id^2,  1)  (5) 

where  (5*  (2,  1)  inplies  a  Dirac  delta  function  in  both  space  and  time  variables. 
The  kernel  ^^0(2,  I)  is  a  4  x  4  matrix  since  the  ^'s  have  four  components.  Such  a 
definition  of  the  kernel  leads  to  a  difficulty  not  met  with  in  the  Schrodinger 
case.  This  arises  from  the  fact  that  the  Dirac  equation  being  relativistic,  there 
exists  a  negative  energy  solution  corresponding  to  each  positive  energy  solution. 
Any  free  particle  solution  ^(2)  at  time  ^2  can  be  obtained  from  the  wave  function 

W{2)=lK,{2,  l)^T^(l)d3a:i.  (6) 

This  means  that  if  we  have  a  wave  function  with  positive  energy  at  t^ ,  it  w^ill  be 
propagated  to  ^2  with  positive  energy.  The  same  applies  to  an  initial  negative 

1  R.  P.  Feynman,  Phys.  Bev.  76,  749  and  769  (1949).  See  also  for  example,  the  semi- 
expository  paper  by  Alladi  Ramakrishnan  and  R.  Vasudevan  on  "  Perturbation  Expansions 
and  Kernel  functions  associated  with  single  particle  wave  functions",  Proceedings  of  the 
Summer  School  of  Theoretical  Physics,  Musoorie,  India  (1959). 
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energy  state.  If  we  postulate  only  the  existence  of  positive  energy  particles, 
the  presence  of  negative  energy  wave  functions  in  the  kernel  does  not  lead  to 
difficulties  when  there  is  no  perturbation  since  the  particle  continues  to  be  in 
the  positive  energy  state  if  once  it  is  in  a  positive  energy  state.  However  in  the 
presence  of  interactions,  a  transition  to  a  negative  energy  state  can  take  place 
and  since  the  negative  energy  states  are  also  propagated,  we  now  have  a  finite 
probabihty  of  realising  a  negative  energy  state  at  a  later  time.  This  is 
physically  impossible.  The  way  out  of  this  difficulty  was  found  by  Feynman  who 
defined  a  modified  kernel  as  follows : 

Kf(2,  1)=    i?   vSVl*"         for     h>t^ 

m,E>0 

=  ~    E    WU'V'J'^     for     (,<<!.  (7) 

m,E<0 

This  kernel  was  obtained  by  Feynman  by  imposing  two  requirements : 

(1)  Only  positive  energy  states  are  propagated  "forward  in  time". 

(2)  The  modified  kernel  should  be  a  solution  of  the  inhomogeneous  equation. 
Both   these    are   satisfied   if  we  remove  from  ^q  (2,  1)  the  sum    J]    y^m(2)^S^ 

m,E<0 

the  latter  being  defined  both  for  positive  and  negative  values  of  t  and  is  there- 
fore a  solution  of  the  homogeneous  equation,  i.e.  with  d^{2, 1)  replaced  by  zero. 
The  negative  energy  states  are  propagated  "backward  in  time"  but  the  propagator 
for  ^2  <  h  is  obtained  not  merely  by  going  over  from  positive  to  negative  energy 
but  by  adding  a  minus  sign.  This  minus  sign  is  closely  connected  with  the  Pauli 
principle  as  we  shall  see  presently. 

We  shall  now  establish  the  correspondence  between  negative  energy  states 
occurring  in  the  kernel  function  and  the  positron  formalism  of  Dirac.  As  already 
mentioned  in  our  discussion  of  the  Dirac  equation  it  was  postulated  by  Dirac 
that  all  negative  energy  states  are  filled  and  Pauli's  exclusion  principle  forbids 
a  first  order  transition  of  a  positive  energy  electron  to  a  negative  energy  state. 
The  vacuum  with  no  observable  charge  is  therefore  to  be  interpreted  as  the  nega- 
tive energy  "sea"  of  electrons.  However  if  due  to  a  perturbation,  one  of  the  electrons 
in  the  negative  energy  state  ( — E,p)  jumps  to  a  positive  energy  state  (-\-E',p') 
the  hole  created  by  the  negative  energy  electron  is  to  be  identified  with  a  positron 
of  positive  energy  4  E  and  momentum  — p.  Thus  in  the  Dirac  theory  we  deal 
with  only  positive  energy  particles,  i.e.  electrons  and  positrons  and  the  above 
transition  amounts  to  the  creation  of  a  pair.  The  first  order  process  is  not  physi- 
cally realisable  due  to  the  requirement  of  simultaneous  conservation  of  energy 
and  momentum.  However  a  second  order  process  is  possible  if  we  assume  that  this 
hole  is  occupied  by  the  incident  positive  energy  electron.  To  describe  this  in  the 
perturbation  expansion  we  should  envisage  states  with  electrons  and  posi- 
trons with  their  numbers  changing  at  every  vertex  where  the  perturbation  acts. 
Since  time  has  to  be  ordered  in  the  expansion,  the  above  second  order  process 
is  characterized  by  a  perturbation  at  ^^  where  a  pair  is  created  and  at  t^it^  >  ^i) 
where  the  positron  is  annihilated  along  with  the  incident  electron.  Thus  for  times 
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t  <i  ti  we  have  a  single  electron,  for  t  between  t^  and  t^  a  pair,  i.e.  an  electron 
and  a  positron  besides  the  incident  electron  and  for  ^  >  ^g  ^^^  electron  which  was 
created  at  t^  along  with  the  positron.  If  we  insist  on  the  time  ordering  in  the 
perturbation  it  is  obvious  that  we  cannot  directly  take  over  our  formalism  of 
single  particle  wave  functions  and  we  will  be  constrained  to  use  the  theory  of 
quantized  fields  to  be  described  later. 

However  the  use  of  the  kernel  function  Kp  of  Feynman  in  the  perturbation 
expansion  takes  into  account  all  these  while  retaining  the  single  particle  formalism. 
If  at  a  vertex  {k),  the  electron  jumps  from  a  positive  energy  state  to  a  negative 
energy  state,  it  is  propagated  by  the  kernel  Kp  to  the  vertex  (k  +  I) 
with  tjg  +  iKtjg.  The  transition  to  a  negative  energy  state  at  tj^  corresponds  in 
the  Dirac  theory  to  the  destruction  of  a  positron  which  must  have  been  created 
earlier  at  ^a;  +  i  <  ^A;-  The  propagation  of  a  negative  energy  electron  to  an  earlier 
point  tj^+i  is  equivalent  to  the  propagation  of  a  positive  energy  electron  from  ^;^  +  j^ 
to  tjg.  This  is  realised  by  noting  that  the  term  E{tj^  —  tj^^i)  =  \E\{tjg^i  —  tj^) 
if  E  is  negative,  the  spinors  with  negative  energy  electrons  being  interpreted  as 
the  spinors  of  the  positive  energy  positron.  In  a  somewhat  picturesque  manner 
we  can  say  that  "an  electron  going  backward  in  time  looks  like  a  positron  going 
forward  in  time".  If  two  time  points  ^i,  ^2  are  represented  by  A  and  B(t2  <  t-^), 
the  state  at  J5  of  a  free  electron  starting  at  A  and  travelling  backward  in  time 
to  B  is  identical  with  the  state  at  A  of  a  positron  starting  from  B  and  travelling 
forward  to  A. 

The  occurrence  of  the  negative  sign  before  the  sum  over  negative  energy  states  in 
the  kernel  Kp(2,  1)  for  t^  <  f^  meets  the  demand  of  the  hole  theory  that  the 
amplitude  should  have  its  sign  reversed  in  considering  the  double  scattering 
of  an  electron  with  the  creation  and  annihilation  of  a  positron  in  the  intermediate 
state  since  such  a  scattering  amounts  to  an  exchange  of  the  incident  electron  with  an 
electron  in  the  "sea".  Thus  the  requirement  of  the  Pauli  principle  is  met  by  the 
use  of  the  Feynman  kernel  with  the  negative  sign  attached  to  the  sum  over  the 
negative  energy  states. 

There  is  an  alternative  way  of  convincing  ourselves  that  Kp(2,  1)  should  be 
the  correct  form  of  the  kernel  based  upon  arguments  in  extending  perturbation 
theory  to  the  case  when  solutions  are  traced  back  in  differential  equations. 
This  leads  to  a  kernel  which  is  identical  with  K^  except  for  the  omission  of  the 
negative  sign  and  the  ascription  of  the  negative  sign  to  the  perturbation  at  the  vertex 
^2  if  ^2  <  ^1  •  Since  Feynman's  arguments  have  been  accepted  as  a  part  of  quantum 
electrod3mamics  we  need  not  interrupt  our  discussion  of  that  formulation  by 
these  alternative  arguments. 

The  Fejmman  kernel  in  configuration  representation  has  no  explicit  expression 
except  as  a  sum  of  an  infinite  number  of  terms.  However  we  shall  show  that  the 
4-dimensional  transform  of  the  kernel  is  expressible  in  an  elegant  operational  form 
and  fortunately  it  is  only  the  transform  that  is  required  in  calculations. 

To  realise  the  significance  of  this  we  first  indicate  how  the  free  particle  kernel 
in  momentum  representation  is  obtained.  The  free  particle  eigenfunctions  ipni^) 
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are  of  the  form  u^^  {p)  e~*^  "^  where  u  {p)  is  the  spinor  corresponding  to  momentum  p 
with  positive  or  negative  energy  and  spin  up  or  down.  Then  the  unperturbed 
propagator  is  given  by 

Kf{2,  1)  =  i;  KiP)  Unip)  e-^^-(^— '^^)  t^>t, 

+  E 
-E 

The  summation  over  n  becomes  an  integral  over  p  (since  E  and  p  can  take  a 
continuous  range  of  values  in  the  case  of  a  free  particle). 

ir^(2,  1)  =  2;      [  ^MP)n{p)e-iP-^d^p     for     t,>t, 

spill      J      ^  J^-'p 
Ep>0 

^      r-^e-^-^i^(jp)iZ(i>)d3^     for     t^<t^  (9)      : 

spin     J      ^J^v 


and     =  — 

spii 


^p<0 


where  we  have  set  x^  —  x^  =  x  and  used  the  Feynman  normalization  uu  =  2Ep. 
We  note  that 

2jU{p)u{p)^p  +  m  (10) 


spin 

for  both  positive  and  negative  energy  spinors. 

If  we  wish  to  normalize  the  wave  function  as  u(p)  u{p)  =  2  \Ep\  then  in  the 
expression  for  Kp{2,  I)  for  E^  <C  0  we  have  to  write  2  |^^  |  instead  of  2Ep  in  the 
denominator.  But  then 

2]uu  =  —{p  +  m).  (11) 

spin 

Thus  the  ratio      ,7-  under  the  normalization  uu  =  2  \Ep\  is  identical  with 

uu 
the  ratio  — —  under  the  normalization  uu  =  2Ep.  We  have 


1        d^p 


r  1       .       d^p      ^         t>o 

=  (,V  +  «)j-^e—^^     for  (12) 


'P 

Similarly 


Kj.{2,  l)  =  -(iV  +  m)j^e-i^- 


..    d>       .„„         <<0 


(2.)3      f-     E„<0-  <^^> 

Equivalently  we  can  write 

Kf{2,  l)^{iV+m)AF{x)  (14) 


where 

r      1 

Q-iEpt  +  ip-x 


^-("'  -fwE 


d^p 
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for  both  positive  and  negative  energy  and  times.  Noting  that 


=  hm—    /  — ~dp^  (15) 

-oo 

we  recognize  that 


d*^ 


AF{x)  =  i  fe-'P-"^— ^ — -j^z-^  (16) 

and 

KF{2,l)  =  {iV+m)AF{x)  =  ije-ip-^^^±^d^p.  (17) 

The  momentum  transforms  dji^ip^)  and  kp{p)  ofAp{x)  SindKp{x)  are  found  to  be 

dF(p')-il{p^~m^)  (18) 


and 


^Fip)  =  (p  +  m)     ,  '      ,  =  —^ .  (19) 

p^  —  m^        p  —  m 


We  could  have  obtained  the  same  result  from  the  equation 

{iV  —  m)Kp(2,l)  =  id{2,l).  (20) 

Momentum  representation  of  ^ Vis  p  and  of  ^(2,  1)  is  unity.  Taking  the  4-dimen- 
sional  transform  of  both  sides  we  have 


(p  —  m)  kpip)  =  i     or     k^ip)  = 


m  (21) 


Calculation  of  matrix  elements 


We  shall  in  all  the  electrodynamic  processes  consider  the  perturbed  system 
to  be  an  electron  and  the  perturbation  as  an  electromagnetic  field  A .  The  perturbed 
kernel  K^  (2,  1)  will  satisfy  the  equation 

{i'7^-eA-m)Ki{2,l)  =  id{2,l).  (22) 

This  admits  a  series  expansion  for  K^  in  terms  of  the  unperturbed  kernel 

Ki{2,  l)^Kp{2,  \)-ieJKF{2,  3)A(3)ir^(3,  l)d(3) 

+  i^e^lKp{2,  4)  A (4)^^(4,  3)  A (3)^^.(3,  1)  d(3)  d(4)  +  •  •  • .      (23) 

In  perturbation  problems  we  are  only  interested  in  the  matrix  element  bf^  of 
transition  from  an  initial  free  particle  state  ip^  to  a  final  free  particle  state  tpf 
both  being  definite  momentum  states: 

bfl  =^jxpf{2)  Ki  (2,  1)  ^,(1)  d(2)  d(l)  (24) 

where 

^^(1)  =  u{pi)e-^Pi''^ 

V>/(2)  =  w(^y)e+^^/-^». 
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Using  the  expansion  for  Kp ,  we  note  that  the  zeroth  order  term  vanishes  as 
y}f  and  y)^  are  orthogonal.  The  transition  amplitude  to  the  first  order  is 

(-i)/.7*,(2)eA(2)yi(l)d(l).  (25) 

Substituting  for  ip^  and  ipf  and  integrating,  we  can  reduce  the  matrix  element 
to  a  form  in  which  only  the  momentum  transforms  of  perturbations  and  kernels 
are  required.  We  shall  obtain  this  reduction  to  the  first  and  second  order  and 
generalize  the  mode  of  reduction  to  the  ^th  order. 
Defining  the  momentum  transform  of  A  {x)  as 

^{q)=jA{x)e'^'^d^x  (26) 

we  obtain  the  first  order  matrix  element 

u{Pf)z{q)u(pi)  (27) 

with 

The  first  order  transition  from  p^  to  p^  can  occur  only  when  the  four  momentum 
Pf  —  Pi^  Q.  ^^®  ^^  time  component  or  equivalently  a  (q)  must  be  the  transform 
of  a  function  of  space  only,  as  in  the  case  of  the  Coulomb  potential.  The  electron 
changes  its  momentum  alone  without  picking  up  any  energy  from  the  Coulomb 
field.  If  however  the  perturbation  is  due  to  a  photon,  the  first  order  matrix 
element  vanishes  since  Pf  ^  q  +  Pi  where  we  substitute  A  {k)  for  {47ie^f^-  ee*^'^. 
However  the  second  order  matrix  element  is  non- vanishing  and  is  given  by 

{-i)^lfiPf{2)eA{2)KF{2,l)eA{l)y^i{l)d{l)d{2).  (28) 

We  now  substitute  for  ^^(2)  and  ^^(1)  the  free  particle  wave  functions  and  write 
the  perturbation  A(l)  as 


^^^'^  =  -(^J^^^'^^~'''^^'^ 


(29) 


and  the  kernel  as 

_J__ei^-(^.-^.)-i!^.  (30) 

p  —  m  (2  Tir 

Integrating  over  the  space  time  points  (2)  and  (1)  we  get 

{-i?\\e2i{q^)--—- — ea(gi)2^i     J^    .  (31) 

J.^  Pi  +  qi  —  m  [inr 

When  the  perturbations  are  due  to  photons,  we  have 

a(gi)==eV47r,  (32) 

and  the  matrix  element  is  given  by 

M  =  U9Qc, ei%.  (33)    ; 
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When  one  of  the  perturbations  is  a  Coulomb  potential,  then 

.ea(?)  =  — ^5(g4)  =  v(qf)  (34) 

and 

if^(-yfaep^^;_^vW.,^.  (35) 

When  both  the  perturbations  are  potentials,  we  have 

M  =  (-«)^/e.v(,,)-^-^i— -v(«,)«,^. 


(36) 


The  integration  over  the  space -time  variables  corresponding  to  the  initial  and 
final  wave  functions  puts  the  terms  of  the  matrix  element  in  the  momentum 
representation  with  the  arguments  of  the  four  momenta  associated  with  each 
term  depending  upon  the  initial  and  final  momentum  states  and  the  perturbation, 
i.e.  it  is  not  the  kernel  that  occurs  in  the  matrix  elements  but  only  its  four- 
dimensional  transform.  If  p  is  the  momentum  of  an  electron  which  absorbs  a 
photon  of  momentum  q,  the  transform  of  the  propagator  with  energy-momentum 

argument  {'p  +  q)  is 


p  +  q  —  m 

It  is  interesting  to  examine  what  happens  if  in  the  Feynman  perturbation  ex- 
pansion we  integrate  with  respect  to  space  and  time  separately,  i.e.  we  wish 
to  obtain  an  expression  for  the  4-dimensional  transform  of  the  kernel  by  first 
integrating  with  respect  to  space  and  then  with  respect  to  time.  Integrating  with 
respect  to  space  amounts  to  picking  out  from  the  kernel  the  terms  corresponding 
to  momentum  p  +  q  and  energy  E(p+  q)  which  can  take  two  values,  i  \E{p-{-q)  |. 
When  integrating  with  respect  to  time  from  —  oo  to  +  oo,  we  must  use 
-r  \E{p  +  q)  I  for  the  range  0  to  +  oo  and  —  \E{p  +  q)\  for  the  range  — oo  to  0. 
Performing  the  integration,  we  have 

+  - 

P+m  P+m 


l\E{p  +  q)\(E{p)  +  E{q)-\E{p-^q)\)       \E{p  +  q)\{E{p)-^E{q)+\E{p-\-q)\} 

(37) 
+ 
where  P  =^  p  -\-  q  and  P  implies  that  we  have  used  -\-  \E(p  +  ^)  |  as  the  coefficient 

of  Yi  and  P  implies  the  use  of  —  |^(p  +  g)  |  as  the  coefficient  of  y^. 

The  matrix  element  with  the  various  terms  in  the  momentum  representation 
can  be  written  down  by  describing  the  processes  by  what  are  known  as  Feynman 
graphs.  The  familiarity  with  Feynman  graphs  and  the  ease  with  which  the  cal- 
culations are  made  using  them  have  led  many  physicists  to  beUeve  that  the 
Feynman  formalism  is  intuitive  or  is  in  the  nature  of  an  algorithm  while  the  field 
theoretic  approach  is  rigorous  and  deductive.  The  author  is  of  the  opinion  that 
as  long  as  in  field  theory  interactions  are  expressed  only  through  annihilation 
and  creation  operators  (see  later  chapter  on  field  theory)  and  wave  functions  of  single 
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particles,  the  domains  of  applicability  of  the  Feynman  and  field  theoretic  descrip- 
tions are  co-extensive  as  far  as  the  perturbation  calculations  are  concerned. 

We  shall  therefore  describe  this  graphical  representation  in  detail.  For  this 
purpose,  we  start  with  the  perturbation  expansion  with  the  integrand  in  configura- 
tion representation.  For  obvious  convenience,  we  can  denote  the  space  variables 
along  one  axis  (say  x)  and  the  time  variable  along  the  other  (say  y).  The  integrand 
in  (36)  is  then  represented  diagramatically  as  follows : 


Fig.  1.  Feynman  diagram  of  an  electron  scattered  twice. 


An  electron  with  wave  function  ^^(1)  is  perturbed  at  the  space-time  point  (1) 
by  the  potential  e  A  (1)  and  then  moves  to  the  space-time  point  (2).  The  free  particle 
kernel  is  represented  by  the  line  joining  the  points  (1)  and  (2).  Again  at  (2)  the 
electron  is  perturbed  by  e  A  (2)  and  emerges  with  wave  function  ipf  (2).  On  integrat- 
ing with  respect  to  space  time  variables,  the  momentum  transforms  are  obtained  in 
the  same  sequence  as  in  the  integrand  and  therefore  we  can  still  represent  the  matrix 
element  by  the  same  graph  writing  instead  of  the  kernels  and  perturbations  their 
corresponding  transforms  and  retaining  the  indices  (I)  and  (2)  for  the  vertices. 
We  should  however  remember  that  the  vertices  merely  indicate  the  sequence  which  now 
bears  a  one  to  one  correspondence  with  the  space-time  points  in  the  integrand.  Then 
the  process  is  described  as  follows : 

The  electron  with  momentum  p,  and  wave  function  w^  moves  as  a  free  particle. 
At  point  (1)  it  is  scattered  by  a  photon  of  momentum  q^.  Having  absorbed  the 
photon,  the  electron  moves  from  (1)  to  (2)  as  a  virtual  particle  of  momentum 
p  +  q^.  It  gets  scattered  again  at  (2)  by  the  second  photon  of  monentum  qg. 
Then  it  emerges  from  (2)  as  a  free  particle  with  momentum  p2=Pi  +  qi  +  q2- 

We  see  that  there  is  no  significance  attached  to  the  two  axes  and  it  is  only  the 
sequence  of  vertices  which  is  important.  This  is  also  implied  when  we  take  the 
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propagator  as 


which  means  that  time  has  been  integrated  out  and  hence 


p  — m 

includes  double  scattering  through  the  creation  and  annihilation  of  pairs.  How- 
ever we  might  wish  to  specify  (i)  whether  the  incoming  and  outgoing  particles 
are  electrons  or  positrons,  (ii)  whether  the  photon  is  emitted  or  absorbed  at  a 
vertex.  Since  in  the  Feynman  point  of  view  a  positron  is  interpreted  as  a  negative 
energy  electron  travelling  back  in  time,  it  has  become  customary  even  in  the 
momentum  representation  to  maintain  the  time  sense  as  far  as  incident  and 
emergent  particles  are  concerned,  i.e.  the  arrows  have  to  be  pointed  upwards 
for  electrons  and  downwards  for  positrons.  Similarly,  the  photon  that  is  absorbed 
is  represented  by  a  line  attached  to  a  vertex  from  below  and  the  photon  that  is 
emitted  is  represented  by  a  line  which  is  attached  to  a  vertex  from  above. 

However  if  we  wish  to  retain  time  ordering  in  a  strict  sense  for  the  entire  process  y 
i.e.  including  intermediate  states  we  have  to  do  the  space  integration  first  and 
time  integration  subsequently.  In  this  case  only  time  has  a  definite  meaning 
while  the  other  axis  is  necessary  only  for  diagrammatic  representation  and  has 

no  significance  as  a  space  axis.  Hence  we  split  the  propagator 

where  " 

p  +  m 


as  A^-^  A 


A    = 


\E(P  -f-  q)\[E{p)  +E{q)-  \E{p  +  g)|] 

-  (P  +  m) 

E  {p  +  q)\[E  {p)  +  E  {q)  -\-\E  {p-^  q)\] 


(38) 


(39) 


Fig.  2.  Diagrammatic  representation  of  propagators  ^+  and  A. 


and  the  corresponding  graphs  for  A+  and  A_  are  given  above  (Fig.  2) .  We  have  thus 
seen  that  the  Feynman  formalism  is  essentially  characterized  by  two  features: 
(1)  even  when  pair  creation  and  annihilation  of  particles  are  included,  electro - 
dynamic  processes  can  still  be  described  through  a  single  particle  formalism  which 
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obviates  the  use  of  field  theoretic  methods  and  (2)  the  perturbation  expansions 
are  "inherently  four-dimensional"  and  the  covariance  of  the  equations  is  apparent 
at  every  stage  of  the  calculations,  the  integration  over  space-time  variables 
being  performed  together.  However  the  elegance  of  the  four-dimensional  methods 
is  still  maintained  even  if  we  integrate  the  space  and  time  variables  separately 
which  leads  to  the  decomposition  of  the  Feynman  propagator.  This  leads  to  a 
better  understanding  of  virtual  processes.  We  shall  indicate  the  centribution 
to  the  matrix  element  from  transitions  to  negative  energy  virtual  states  in  electro- 
dynamic  processes  which  will  be  considered  in  the  next  section. 

We  shall  now  summarize  the  method  of  writing  the  matrix  element  correspond- 
ing to  a  Feynman  graph  by  giving  below  what  are  known  as  the  Feynman  rules. 

(i)  An  electron  in  a  virtual  state  of  four  momentum  f  contributes  an  amplitude 

*/(p  -  ^)- 

(ii)  The  amplitude  for  picking  up  a  momentum  q  from  a  potential  is  — ie2i{q) 
where  a  (q)  is  the  momentum  transform  of  A  (x)  representing  the  potential  which 
is  independent  of  time : 

A{x)=ja{q)(i-^'i-^d{q^)^^q.  (40) 

In  particular  when  A^,  =  0  for  ^  =  1,  2,  3  and  A/^  =  q)  = 

A  real  photon  of  polarization  e  contributes  the  factor  a(g)  ^^  (471)'/*  e  where 
a^  =  (4  jr)'/«e^ .  This  is  because  the  photon  wave  function  is  taken  to  be  4  jr'^^c^e** '^ . 
If  we  have  a  photon  of  polarization  e^  we  set  e^  =  1  and  a^  =^  (471)'/" . 

(iii)  All  indeterminate  momenta  q^  are  summed  over  d^gj(27r)*.  The  matrix 
element  obtained  using  the  above  rules  has  to  be  multiplied  by  the  normalization 

factors  -rj^ —  for  a  photon  and     ,    ^  for  a  real  electron. 


3.  APPLICATIONS  TO  ELECTRODYNAMICS 

We  shall  now  consider  the  following  electromagnetic  processes^, 

(1)  Compton  effect, 

(2)  Two  photon  pair  annihilation, 

(3)  Bremsstrahlung, 

(4)  Pair  creation, 

(5)  Electron-electron  interaction. 

The  five  processes  have  been  chosen  in  this  order  for  convenience  of  comparison 
of  the  corresponding  Feynman  diagrams.  Besides  these  there  are  two  other  im- 

^  R.  P.  Feynman,  Lectures  delivered  at  the  California  Institute  of  Technology,  on  "Quan- 
tum Electrodynamics"  (1953).  For  discussions  on  the  cross-sections,  seeW.HEiTLER,  Quantum 
Theory  of  Radiation,  Oxford  University  Press,  London  (1954). 
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portant  electrodynamic  processes — (1)  the  loss  of  energy  of  fast  particles  by  ioni- 
sation,  (2)  loss  of  energy  by  magnetic  Bremsstrahlung.  Since  these  processes 
can  be  satisfactorily  considered  by  the  use  of  semi-classical  methods,  we  shall 
postpone  their  discussion  to  a  later  stage  (see  Part  II). 


Compton  effect 

This  is  the  process  of  scattering  of  a  photon  by  a  free  electron.  A  photon  of 
momentum  q^  (q^ ,  coj)  and  polarization  Ci  is  incident  on  an  electron  which  is 
assumed  to  be  at  rest,  since  in  the  rest  system  of  the  electron  the  calculation  of  the 
cross-section  is  greatly  simplified.  The  general  case  when  the  electron  is  not  at 
rest  can  be  obtained  from  this  by  a  Lorentz  transformation.  In  the  final  state, 
we  have  a  photon  of  momentum  q^{q^,  CO2)  ^nd  polarization  €2  and  an  electron 
of  momentum  p2  {p2 ,  ^2)-  From  the  Feynman  formalism  we  can  regard  the  process 
as  follows.  The  incident  photon  q^  is  absorbed  by  the  electron  of  momentum 

i 
Pi{0,  m).  Absorbing  the  photon,  the  electron  propagates  itself  as  ^ . 

Subsequently  it  emits  a  photon  of  momentum  ^2  ^he  electron  having  a  residual 
momentum  ^g-  The  conservation  of  energy  and  momentum  is  expressed  by  the 
relation 

Pi  +  qi=P2  +  q2-  (1) 

Instead  of  absorbing  the  incident  photon  first,  the  electron  may  emit  the  final 
photon  and  subsequently  absorb  the  incident  photon.  Thus  there  are  two  possi- 
bilities which  are  represented  by  the  diagrams  below  (Fig.  3).  To  calculate  the 


Pi+«ii 


(R)  (S) 

Fig.  3.  Feynman  diagrams  for  Compton  effect. 

matrix  element  for  the  transition  from  the  initial  to  the  final  state,  we  represent 
the  incoming  photon  by  ^1^=  (47re2)*/*ei^e~**^'^  and  the  emitted  photon  by 
^2/«  =  (4  7re2)V2e2,,e+^*^"^  where  the   normalization  is  to  {2  m)  photons  per  unit 
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volume.  Further  since  the  photon  is  polarized  perpendicular  to  the  direction 
of  propagation  we  have 

ei-gi-0;     Cg-ga^O.  (2) 

Also  ql^  0  and  ql  =  0.  (3) 

The  initial  and  final  state  electron  wave  functions  are  chosen  as 

where 

Pi'Pi  =  m^;     p2-p2=--mK  (5) 
Since  the  initial  electron  is  at  rest  we  have 

Pi  =  *^n ;     P2  =  ^271  —  Yx  P2  cos  (p  +  Yy  P2  sincp .  (6) 

qi  =  ^1  irt  —  Yx) ;     q2  =--  0J2  iyt  —  yx  cos  d  —  yy  sin  d)  (7) 

where  d  is  the  angle  between  the  outgoing  and  incoming  photon  directions  (initial 
photon  direction  is  taken  as  the  x  direction)  and  2n  —  (p  is  the  angle  which  the 
recoil  electron  makes  with  the  x  direction.  The  possible  directions  of  polarizations 


e 
Fig.  4.  Schematic  diagram  for  Compton  effect. 

of  the  incident  photon  will  be  along  the  z  or  y  directions  denoted  as  "Type  ^" 
and  "Type  B''  respectively.  We  then  have 

(A)  e,  =  y,    {B)  e,  -  yy.  (8) 

Similarly  for  the  outgoing  photon  we  have 

{A')  62  =  yz    {B')  62  =  yy  cos  d  -  y^  sin  d.  (9) 

Substituting  relations  (2)  to  (7)  in  (1)  we  can  obtain  the  formula  for  Compton 
shift  in  frequency  as 

J!L_J!L=i_cose.  (10) 

Wo  CO, 
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In  computing  the  transition  probability  the  density  of  final  states  involved  is 
that  of  a  two  particle  system.  This  is  given  by 

flaking  use  of  the  Feynman  rules  stated  in  the  previous  section,  the  matrix 
elements  for  the  diagrams  R  and  S  are  respectively 


and 


—  i{4:7t  e^)  I U2  62 ^1  ^1 )  =  —  *  4  JT e^  U2E  % 

\  Pi  ^~  ^1      ^  / 

—  i{4:7i e^)  I IZ2 e. e, % )  =  —i4:ne^u^8u.. 

\        Pi-q2-^       / 


(12) 
(13) 


Hence  the  differential  cross-section  for  Compton  effect  is  given  by 


Making  use  of  the  identities 

1  p  +  m 


(15) 


p  —  m       p"  —  m" 

and 

ab-2a.6-ba  (16) 

and  the  relations  (8  and  9)  stated  above,  it  can  be  shown  thati?  and>S  reduce  to 

R  =  ^3l^     and     S  =  ^^^.  (17) 

2mft>i  2mft>2 

The  matrix  elements  corresponding  to  the  different  sets  of  polarisations  A  A', 
AB',  BA'  and  BB'  can  then  be  simplified.  The  final  results  are  given  below. 

Polarization  \M_^ 

AA'  Kzi!^  +  4, 

(O1CO2  ' 

AB'  (^1  -  (^2)^ 


G>ift)2 
0)^  COo 


BA 

Oj-^  fjj2 

BB'  <«'x-"^)V4cos^e. 

0)^0^2 

More  generally  we  can  write  the  matrix  element  as 


^         K-<»,)^^  (18) 
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The  cross- section  for  scattering  with  prescribed  directions  Ci  and  e^  of  polarization 
of  the  incoming  and  outgoing  photons  is 


da  =  — ^--f  di?, 
4m^     CO? 


COi 
COo 


(19) 


This  is  the  well-known  Klein-Nishina  formula. 


Discussion  of  the  Kleiyi-Nishina  formula 

In  the  non-relativistic  limit,  co^  <  m  the  electron  picks  up  very  little  energy  and 
hence  co^  ^  cog  •  The  Klein-Nishina  formula  then  takes  the  form 

da  =  -^{e^'e^fdQ^^  (20) 

which  is  the  Rayleigh-Thomson  scattering  cross-section. 

To  obtain  the  scattering  cross-section  independent  of  polarization  we  have 
to  sum  over  the  polarizations  of  the  outgoing  beam  and  average  over  the  incoming 
beam  which  can  be  seen  in  the  following  way.  The  total  probability  for  scattering 
a  photon  of  type  A  to  one  of  the  type  A'  or  B'  is  (A  A'  +  AB').  Since  the  incoming 
beam  can  be  of  type  A  or  B  with  equal  probability  the  cross-section  for  an 
unpolarized  incoming  beam  is 

J{^^' +  ^^'}  +  H^^' +  ^^'} 

=  ^l^(M^di2.l^  +  ^-sin^4.  (21) 

On  the  other  hand  if  the  outgoing  beam  is  taken  to  be  of  definite  polarization,  we 
have,  if  the  incident  beam  is  unpolarized, 

Probabihty  for  polarization  of  type  A'        \{AA'  +  BA') 
Probability  for  polarization  of  type  B'        \{AB'  +  B  B') 

= ^'^^ (22) 

i^  +  ieL_2sin^e. 

The  unscattered  radiation  {6  =  0)  remains  unpolarized  but  the  scattered  radiation 
will  be  polarized  to  a  certain  extent.  When  0)^  r^  cOg  (i.e.  in  the  lower  frequency 
limit),  the  incoming  unpolarized  beam  becomes  plane  polarized,  when  scattered 
through  90°.  If  the  incident  beam  is  polarized  for  large  angles  the  outgoing  beam 
is  unpolarized  and  for  small  angles  it  is  almost  unpolarized  as  0)2! o)  is  negligible. 
To  obtain  the  total  scattering  cross -section,  integration  over  all  angles  should 
be  performed.  Hence,  integrating,  the  total  cross-section  is 

a        3(1+7 


Oq       4  I    )/3 


'''-''''     -log(l-f2,) 


1  +  27 
where  Stz  o)^ 


+  ^Iog(H-2,)-^i±|^)    (23) 
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In  the  non-relativistic  case,  y  <  l,G  =  aQ.  The  right-hand  side  reduces  when 
expanded  in  powers  of  y  to 

+  —  /  +  •••)•  (24) 


a  =  <To(l  — 2^ 


In  the  extreme  relativistic  case 


2a)i 


log ^+1  . 

y  Mx\        m        V 


(25) 


Hence  for  Yevy  high  energies,  the  number  of  quanta  scattered  decreases  with 
increasing  energy  of  the  incident  radiation.  This  is  the  reason  for  the  increase 
of  penetrating  power  of  y-rays  with  frequency  as  long  as  there  is  no  other  ab- 
sorption process  such  as  pair  production. 

Two  photon  pair  annihilation 

In  this  process  a  free  electron  is  annihilated  in  a  collision  with  a  free  positron. 
Conservation  of  energy  and  momentum  requires  that  two  photons  be  emitted. 
In  Feynman  formalism,  this  phenomenon  is  completely  analogous  to  Compton- 
scattering  as  the  diagrams  below  show. 


•i      p-qi       92 


•2     p-q?      «! 


(R)  IS) 

Fig.  5.  Feynman  diagrams  for  two  photon  pair  annihilation. 


The  diagrams  differ  from  those  of  Compton  effect  only  in  the  ordering  in  time. 
The  direction  of  q,  is  reversed  and  the  second  particle  is  a  positron,  i.e.  an  electron 
travelling  back  in  time.  The  electron  of  momentum  ^1  (p^ ,  E-^  emits  a  photon  of 

momentum  ^1(^1,  co^)  and  polarization  d  and  propagates  itself  as -. 

Pi  ~"  ^1  ~  ^ 
It  collides  with  a  positron  of  momentum  p^ip^^  ^2)  resulting  in  the  annihilation 
of  the  pair  with  the  emission  of  a  photon  of  momentum  ^2(^2'  ^^2)-  The  second 
diagram  is  obtained  by  the  interchange  of  the  two  photons.  The  conservation 
law  gives 

P2==Pi-qi-q2  (26) 
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which  is  similar  to  that  for  Compton  effect.  Denoting  the  matrix  elements  of 
the  two  diagrams  as  M^  and  if  2  we  have 

^7Z — - — r^eiJ  (27) 


and 


Jfi  =  —i4:7ie^ 


M2=  —i  4:716^ 


(Pi— qi-^) 
1 


u^ei 


m 


62^1 


(28) 


where 


Pi— q2 

Pi= (^-  rt-P-'  7)      p2=— (^+  n  -p+-  r). 

To  calculate  the  cross-section,  we  apply  the  methods  as  before.  The  electron 
is  assumed  to  be  at  rest  while  the  positron  is  moving.  The  density  of  the  two 
photon  final  states  is  given  by 


CDiCOo 


(27r)3    (o^oy^-q^-q^ 


(29) 


From  the  diagrams  it  is  clear  that  the  matrix  elements  for  pair  annihilation 
are  the  same  as  the  matrix  elements  for  Compton  effect  if  the  sign  of  q^  is  changed. 
In  the  cross-section  this  necessitates  a  change  of  sign  of  co^.  Hence  the  cross- 
section  is 


e^cofdi?. 


4m2(^^  +  m)!^+|    I  CO 


CO 


1  <^2 


4(ei.  62)^1 . 


(30) 


Polarization:  As  in  the  case  of  Compton  effect,  to  consider  the  variation  of 
the  cross-section  with  the  polarization,  it  will  be  more  convenient  to  work  in  a 
coordinate  system  in  which  the  centre  of  mass  of  the  electron  and  positron  is 
at  rest.  Then  the  differential  cross-section  is  given  by 

e*      di^ 


dcr 


rv^ 


E. 


El  -  (^2  _  2pl  cos20)  {e, .  e^r  +  4(p^  •  ej  {p,  •  e^)  (e^  •  e^) 


HP. 


El  —  plGos^d 
^ifiP^-e^y 


{El—plG08^6f 


(31) 


The  polarization  vectors  Ci  and  62  can  be  so  chosen  that  they  are  in  the  p+  q^^ 
plane  or  perpendicular  to  it.  In  accordance  with  this,  the  cross-section  will  be 
denoted  by  dcr||||  when  both  photons  are  polarized  in  this  plane  and  by  dcj^n 
when  one  of  them  is  polarized  in  this  plane  and  the  other  perpendicular  to  it. 
When  p_  is  very  small  dor^^  =  d(j|  1 1 1  =  0  and 

e^dQ 

dcr,  11= (non-relativistic  case) .  (32) 

"       yp^m 

Thus  the  photons  are  polarized  perpendicular  to  each  other.  For  ^q  >  m  no  polari- 
zation effect  exists. 

Integrating  over  0  (0  to  n)  and  cp  we  obtain  the  total  probability  of  annihilation. 
Limits  of  6  are  0  to  tt  since  permutation  of  the  two  photons  does  not  lead  to  a 
new  state. 
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The  integration  jdelds 


where 


_^i.„._,,-^,,i±i, 


"=!:-•■ 


The  above  expression  represents  the  cross-section  for  the  annihilation  of  an 
electron  and  positron  of  equal  and  opposite  momentum  ^^. .  In  the  practical  case, 
the  electron  is  at  rest.  The  probability  of  annihilation  in  this  case  is  obtained  bj^ 
a  suitable  Lorentz  transformation  which  gives  a  as 

''^  +  '''+^ogy+y(,^_l)_#^}  (34) 


where  y  =  E+jm  and  E+  and  v+  are  the  energy  and  velocity  of  positron  and 
Tq  =  e^jm.  The  above  expression  first  deduced  by  Dirac  has  a  maximum  for  small 
values  oiv^(y  ^  1)  and  for  E^  ->m  the  cross-section  diverges.  This  does  not  mean 
that  the  cross-section  Avill  become  infinity.  The  rate  of  annhilation  in  a  substance 
with  'N  atoms/unit  volume  is  given  by  i?  =  NgZv+  ==  NZjir^  sec"^  (in  the  non- 
relativistic  case).  For  lead,  R  =^  2  x  10^^  sec ~^.  Hence  the  life  of  a  very  slow 
positron  in  lead  is  about  10~^*^  sees.  For  higher  energies  the  cross-section  decreases 
and  is  given  by 


jirl  ^p-  jlog-p^-^^ 1  [         (in  the  extreme  relativistic  case) .  (35) 


In  a  Lorentz  frame  in  which  the  electron  is  at  rest,  generally  the  two  quanta  do 
not  have  the  same  frequency.  For  high  energies  of  the  positron  in  the  initial  Lorentz 
system,  the  two  quanta  emitted  are  mainly  in  the  forward  and  backward  direc- 
tions. After  the  Lorentz  transformation  the  quantum  emitted  forward  takes  up 
most  of  the  energy  of  the  positron  and  the  second  quantum  has  energy  only  of 
the  order  of  m.  If  however  the  positron  has  small  kinetic  energy,  both  quanta 
pick  up  almost  equal  energies  (^-m)  and  are  emitted  in  the  opposite  directions 
with  perpendicular  polarizations. 

Bremsstrahlun^ 

An  electron  traversing  in  the  field  of  a  nucleus  is  deflected,  according  to  the 
classical  theory,  due  to  the  inverse  square  law  force  of  attraction.  Since 
deflection  involves  acceleration  according  to  classical  electrodynamics,  the 
electron  should  radiate.  This  process  is  called  "Bremsstrahlung".  In  the 
language  of  quantum  electrodynamics,  there  exists  a  definite  probability  that  the 
incident  electron  wdll  make  a  radiative  transition  to  another  state  with  the  emis- 
sion of  a  photon.  Interaction  with  the  field  of  the  nucleus  is  necessary  to  conserve 
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energy  and  momentum  since  it  is  impossible  for  a  free  electron  to  emit  a  single 
photon  and  make  a  transition  to  a  real  state. 

The  Fejmman  diagrams  for  Bremsstrahlung  are  given  below. 


V(Q) 


p,  +  Q=P2+a 


(R) 


(S) 


Fig.  6.  Feynman  diagrams  for  Bremsstrahlung. 


Fig.  7.  Schematic  diagram  for  Bremsstrahlmig. 


It  is  assumed  that  the  nucleus  is  so  heavy  compared  to  the  electron  that  it 
is  at  rest  throughout  the  process.  This  assumption,  as  will  be  shown  later,  is 
equivalent  to  assuming  that  the  electron  interacts  only  with  the  Coulomb  field 
of  the  nucleus.  The  two  diagrams  describe  the  following  indistinguishable 
processes. 

(1)  The  electron  interacts  with  the  Coulomb  field  and  subsequently  emits 
a  photon, 

(2)  The  electron  emits  a  photon  and  then  interacts  with  the  Coulomb  field. 
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In  the  first  diagram,  an  electron  of  momentum  p^  initiallj^  in  a  state  described 
by  the  wave  function  Uj^e~^^^'^  is  scattered  by  the  Coulomb  potential,  acquiring 

an  additional  momentum  Q.  It  then  moves  as -^^r until  it  emits  a  photon 

Pi  +  Q  -  m  ^ 

of  momentum  q  and  polarization  e.  The  final  state  of  the  electron  of  momentum 

P2  is  ?^2^"*^'"^-  The  matrix  element  corresponding  to  this  diagram  is 

1 


where 


Pi  +  Q  — w 


w{Q)u^ 


(36) 


'^iQ)-—Q^^iQ^)yt  =  HQ)^{Q,)rt^  (37) 

It  should  be  noted  that  the  Coulomb  potential  Ze^/r  is  independent  of  time 
and  hence  the  Fourier  transform  of  Ze'^/r  is  proportional  to  ^(^4).  This  implies 
that  ^4=0  for  the  transition  and  hence  E^  =  E^-]-  o).  The  second  diagram  can  be 
similarly  understood  and  its  matrix  element  Jf  2  is  given  by 


M^  =  {4:71  e^yl- 


(38) 


P2 — Q  —  w 

The  energy  momentum  conservation  requires 

P2=Pi  +  Q-q.  (39) 

The  density  of  final  states  in  this  case  corresponds  to  that  of  a  three  particle 
system  consisting  of  the  nucleus,  electron  and  photon.  Since  the  nucleus  is  con- 
sidered to  be  infinitely  heavy,  the  expression  for  the  density  of  states  can  be 
reduced  to 

0  =  (2  Tty^E^p^  d^2  M^dco  dQ^.  (40) 

The  outgoing  photon  can  be  polarized  in  any  direction  and  the  incoming  and 
outgoing  electron  states  each  have  two  possible  spin  states.  Hence  after  summing 
over  the  photon  and  the  outgoing  electron  polarizations  and  averaging  over  the 
initial  polarizations  of  the  electron,  we  have  the  following  differential  cross- 
section 

1     /Ze2\2     1       dco    ^2    .    ^    ,^     .    ^    ,^    , 

^'  =  ^[^1  "m  -^  -^^^n0,de,sme,d6,d<p 

plsm^d^{4:El-Q^)       plsmW^{4:El-Q^) 


{E^-p^  cos6>2)2  {E^-Vi  cos6>,)2 

2^1  j92  sin6>i  sin  02  cos(p(4:Ei  E^  —  Q^  +  2  co^)  —  2  co^  (^2  ^1^2  6^  +  pf  sin^  l9i) 
(^2  —  P2  cos  62)  {E^  —  pi  cos  dj) 


(41) 


where  Ze  is  the  charge  of  the  deflecting  nucleus,  p^,  E^  the  initial  momentum  and 
energy  of  the  electron,  p^^,  E^  the  final  momentum  and  energy  of  the  electron 
and  CO  the  frequency  of  the  emitted  photon.  Q  is  the  total  momentum  transmitted 
to  the  nucleus  =  {Pi—  p^—  ^)^  ^i  ^  ^^^  angle  between  q  and  p^,  d^  ^^^  angle 
between  q  and  p^,  q)  the  angle  between  the  planes  {qp^l  and  {qp-^). 
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When  the  photon  energy  is  small  compared  to  the  rest  mass  of  the  electron, 
the  matrix  element  reduces  to  the  form 


M={4:7zeyi^v{Q) 


e  Pi        e  p, 


yt 


q  Pi     q  P2 

The  differential  cross-section  for  photon  emission  then  becomes 


da 


271 


2E^2E^ 


'v{Q) 


E^PzdQ^ 


{271) 


e^doj  dQ,. 


7ia> 


[pz-q/co      Pi-qlcoJ 


(42) 


(43) 


which  can  be  easily  interpreted.  The  first  bracket  represents  the  transition  pro- 
bability for  elastic  scattering  while  the  second  bracket  gives  the  probability 
of  emission  of  a  photon  in  the  frequency  interval  d  ft>  and  in  the  solid  angle  d  £2^)  if 
there  is  elastic  scattering  from  momentum  p^  to  P2 . 

The  total  cross-section  for  the  emission  of  a  quantum  with  energy  between  co 
and  CO  +  dco  is  obtained  by  integrating  dor  over  all  angles.  Usually  the  cross - 
section  is  represented  as  a  function  of  coKEj^  —  m) ,  m  being  the  rest  energy  of  the 
electron  and  E^  —  m  its  initial  kinetic  energy.  The  total  cross-section  is  given  by 


''Ay, 


(JO 


m 


=  A 


fdodQ^dQ^ 
dco   P2  f4 


CO     p^ 

8   E^    E^ 
3    Pi     P2 
m^co    /E^E, 


2F.^Ejl±f^n.^(^ 


3       —-'    plpl 


+ 


^2^1 


pI 


£ 
P1P2 


+ 


CO' 


plpl 


{Eli-El  +  plpl) 


where 


'^PlP2 


Pi 


E^  E2 


Pi 


pl 


Pt 


£2  + 


2coEj^E2 

pIpI 


)i 


1^2  \ 
hpJ        I 

\ 


(44) 


.pl  +  PiP2  —  E^o^  =  2 log  ^' ^'  -^PiP2  —  ^' 


Pl  —  PlP2  —  El(^ 

1       Ei  +  Pi       ^1      E^  +  Pi 
e^  =  log  ^f — ^  =  2  log- 


m  CO 


Ei-Pi 


£2  =  2  log 


E2  +  P2 


m 


and 


Z^rl 


137 


Discussion  of  the  cross-section 

1.  As  in  the  case  of  the  production  of  continous  X-ray  beams,  where  primary 
energies  are  small,  i.e.  in  the  non-relativistic  region,  E-^:^  E^^^  rn  we  can 
neglect  p^  and  co  compared  to  m.  The  differential  cross-section  is  then  given  by 

_  2Z^e^    dco    P2    ^ind^dB^mnd^dd^dcp 
^~  7rl37    ~^  ^  Q^ 

[p\  sin^  ^2  +  Pl  sin^  Bi  —  ^Pi'P2  sin  ^i  sin  6^  cos  99} .  (45) 
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For  a  given  angle  between  p^  and  'p^, ,  the  maximum  intensity  of  emitted  radiation 
is  perpendicular  to  the  plane  of  motion  of  the  electron.  This  corresponds  to  the 
classical  theory  where  there  is  maximum  emission  perpendicular  to  the  plane 
of  acceleration.  The  intensity  in  a  certain  direction  6j_  is  obtained  by  integrating 
the  above  expression  over  all  directions  of  p^.  The  total  cross-section  for  emission 
of  the  quantum  co  is  given  by 


/  CO  \  16    da>    m2         2^1  +  ^2 

\ToJ  3      0)      p^      "^  p^  —  p^ 


f2  ^   1-6-2"?= 

Ze^ 

1.  =  ^^- 

^1 

'^'~      V,     ' 

Pi      P1  —  P2 

where  Tq=  E^  —  m  -=  p\l2m,  the  initial  kinetic  energy  of  the  electron.  At  the 
limit  o)  =  p\l27n,  a^  appears  to  vanish  but  this  result  is  true  only  in  the  case 
of  a  pure  Coulomb  field  and  when  Born  approximation  is  valid. 

2.  For  a  low  energy  electron,  the  Born  approximation  is  no  longer  valid. 
Sommerfeld  has  computed  the  correction  factor 

(47) 

which  has  to  be  applied  to  the  total  cross-section  v^  and  v^  being  the  velocities 
before  and  ofter  collision.  This  correction  contributes  for  large  values  of  |,  and 
the  correction  becomes  appreciable  for  heavy  elements. 

3.  The  charge  distribution  of  the  outer  electrons  of  an  atom  produces  a  screening 
of  the  Coulomb  field  of  the  nucleus  which  is  appreciable  for  high  primary  energies. 
The  impact  parameter  is 

(i.e.  a  minimum  value  oiQ  leads  to  a  large  increase  in  the  cross-section)  and  for  a 

CO 

given  ratio  of -=- ,  the  higher  the  primary  energy  the  larger  is  r^ax  •  If  w  ^  -£^2 »  ^max— 
^2  137m 

radius  of  the  K  shell  for  E^  ^^ .  Thus  for  high  energies  screening  of  the 

Zi 

Coulomb  field  leads  to  a  decrease  in  the  cross -section.  Assuming  the  value  obtained 
from  the  Thomas-Fermi  model  for  the  atomic  radius, 

ar^a^Z-'l^r^mX^Z''!^  (49) 

where  a^  is  the  Bohr  radius  of  the  hydrogen  atom  and  if  fj^ax  is  large  compared 
to  a  (i.e.  in  the  case  of  complete  screening)  we  may  replace  fj^ax  ^Y  '^'  ^^'^  ^^^ 
cross-section  thus  obtained  will  be  of  the  right  order.  In  the  case  of  complete 
screening  the  formula  becomes 

21og(183Z-Vs)  +  l|  (50) 

E1E2 


iih-'-'-^MW-l 


for  ^^^^^~ 
mco        A 
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4.  In  the  extreme  relativistic  case^  (^1^2  >  ^)  ^^id  for  large  angles  I  6^  >  — -j 

the  intensity  of  radiation  emitted  in  the  energy  interval  d  co  at  a>  in  a  solid 
angle  dQ^  at  an  angle  d  is  proportional  to  co  dco^  and  is  given  by 

.d.,=^,^d(^)d^.(-j;-i_T  (51) 


where 


2>l„^2^1 


T={l+y^)log 


m 


cot2  0/2 


[  sin2  (9/2 


+  9  (7,0) 


and  y  =  EJE-^.  In  T  the  angular  dependence  has  been  separated  out  as  g{y,  d) 
which  depends  only  on  the  ratios  of  energy,  from  the  other  term  depending  directly 
on  the  absolute  value  of  E^ .  Screening  can  be  neglected  only  if 

^»  11^  J  [mil  •  ^''^ 

For  zero  angle  scattering,  i.e.  for  radiation  in  the  forward  direction,  the  cross- 
section  is  given  by 

CO  \  4  (  ^f  +  £;|         2^1  E^        {E,  +  Ec,f  ^ 


a>d.„,..=.^..a^.d(^)ij^^log. 


m  CO  2m^ 

with  the  restriction 


(53) 


E. ,  Eo  »  m     and  Ml^  «  1 37  ^- V, .  (54) 


Pair  production 


A  photon  of  energy  greater  than  2  m  can  create  an  electron-positron  pair  when 
passing  through  the  field  of  a  nucleus.  The  presence  of  a  nucleus  is  necessary  to 
conserve  energy  and  momentum. 

There  are  two  ways  in  which  this  process  can  happen. 

(1)  The  incoming  photon  creates  a  pair  and  subsequently  the  electron  interacts 
with  the  field  of  the  nucleus. 

(2)  The  photon  creates  a  pair  and  the  positron  interacts  with  the  field  of  the 
nucleus.  As  in  Bremsstrahlung,  we  assume  that  the  nucleus  is  stationary.  The 
Feynman  diagrams  are  as  shown  below.  Comparing  these  diagrams  with  those 
of  Bremsstrahlung  we  find  that  the  diagrams  look  alike  except  for  the  ordering 
in  time.  For  example  in  the  diagram  (R)  the  particle  of  momentum  p^  is  scattered 
by  Coulomb  potential  and  then  emits  the  photon.  However  when  the  ordering 
in  time  is  taken  into  account,  we  observe  that 


1  P.  V.  C.  Hough,  Phys.  Rev.  74,  81  (1948). 
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(1)  pi  is  a  positron,  i.e.  an  electron  travelling  backward  in  time, 

(2)  The  photon  q  is  absorbed. 

Hence  the  Bremsstrahlung  matrix  elements  can  be  used  for  this  process  if  p^  is 
replaced  by  —  p+  (where  p^  is  the  positron  momentum)  and  q  by  — q_^.  The  density 
of  final  states  here  is  different  since  the  particles  in  the  final  state  are  a  positron 


(R)  (S) 

Fig.  8.  Feynman  diagrams  for  pair  production  by  a  photon  in  Coloumb  field. 


and  an  electron.  It  should  be  noticed  that  the  density  of  final  states  in  this  case 
corresponds  to  a  three -particle  system  as  in  the  case  of  Bremsstrahlung.  Thus  the 
differential  cross -section  is  given  by 


da 


1    /  Z  e2  \2    1 


p+p^  sin0^d0+  sm6_dd_  d(pdE^ 


271  \    Q^  j    137 
^!-sin2  0_.(4^2_^2) 


+ 


J92.sin2(9^.(4^!— ^2) 


+ 


{E_  —  p_  cosd_f         '  {E^  —  p^GosO^)^ 

2p^ p_  sin (9^  sin(9_  cosy  •  (4jg^  E_  +  Q^~2co^) 
{E_  —  p_  cos d_)  {E^  —  p^  cos  d^ ) 


{E_  —  p_  cos  Q)  {E_^  —  p_^  cos  0^) 


(55) 


\\here  p_  and  p+  are  the  momenta  of  the  electron  and  the  positron,  E_  and  E+ 
their  energies,  0_  and  6+  the  angles  which  the  directions  of  the  electron  and  posi- 
tron make  with  the  direction  of  the  photon,  cp  is  the  angle  between  the  electron- 
photon  plane  and  positron-photon  plane  and  Q^  ^  (q — p+  — p^)^.  This  formula 


^  This,  as  well  as  the  fact  mentioned  earlier  that  the  matrix  elements  for  Compton  scattering 
and  the  two-photon  pair  annihilation  are  connected,  are  special  cases  of  what  is  known  as  the 
"substitution  law"  which  is  discussed  in  more  detail  in  Chapter  V.  See  also  J.  M.  Jauch  and 
F.  RoHRLiCH,  The  Theory  of  Photons  and  Electrons,  Addison-Wesley  (1955),  p.  161. 
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has  been  obtained  from  the  corresponding  one  for  Bremsstrahlung  by  the  following 
substitutions : 

p_  for  ^2 J  — V+  fo^  Vi^  ^-  for  E2,  —E+  for  E^,  — 0_  for  02>  — Q+    for  ^1  and 
—  ft>  for  CO. 

Integration  over  the  angles  gives  the  cross-section.  In  the  extreme  relativistic 
case,  where  all  energies  are  large  compared  to  the  rest  energy  of  the  electron, 
the  total  cross-section  becomes 


E'^  +  E'^  -\- ^U  E   El        2E   E  \ 

*  oy  \  mm  J 


(56) 


The  limits  of  validity 

The  use  of  Born  approximation  demands  that 

2  7iZe^  2  7iZe^      , 

<1, <1,  57) 

v^  v_ 

i.e.  the  velocities  of  both  electrons  must  be  large  and  Z  small.  If  v+  ,  v_  <  velocity 
of  light.  Born  approximation  can  no  longer  be  applied  and  the  Sommerfeld  form 

factor  -7-5—- 777^ »   e  ,   is  introduced  and  this  produces  an  asymmetry 

in  the  expression.  The  positron  is  repelled  by  the  nucleus  and  the  electron  attracted. 
The  probability  for  pair  creation  gets  small  for  small  p+  and  large  for  small  p_. 
A  similar  correction  has  to  be  made  in  the  relativistic  case  for  heavy  elements 
(i.e.  large  Z  values). 

(2)  The  charge  distribution  of  the  outer  electrons  produces  a  screening  of  the 
Coulomb  field  of  the  nucleus.  This  effect  is  appreciable  only  at  high  primary  energy. 

2E.E_ 

As  in  the  case  of  Bremsstrahlung,  the  expression  is  valid  for <  137  Z~  '*. 

(3)  When  E+ ,  E_  >  m,  and  for  large  angles,  the  limit  is  given  by 

da  =^d(-^)d^  (— )-^^  m 

\   CO    J  \  CO2  /    4  TT 

Z2  /  e^  \2 
where  A  =  -r^r;^  —     and  S  contains  the  angular  variation.  The  above  is  the 

137  \  m  /  ^ 

probabiHty  per  nucleus  of  pair  production  in  a  solid  angle  dQ  at  an  angle  B(=d^)  and 

in  the  energy  interval  dE_  at  energy  ^_  and  S=  {l-{-y)^  log [    .   '^  '     1  +  /  (r,  6). 

771/    \  Sin  Oj^  I 

It  can  be  seen  that  in  >S,  we  have  separated  the  term  depending  on  E_  directly, 

E 

the  other  term  /  (r,  6)  depending  only  on  E_  \m  and  y  =  -^ .  This  is  valid  only 

E  _ 
TYh 

when  E^,E^^  m  and  0  > . 

E_ 

Momentum  transfer  in  a  production  process  is  large  enough  for  screening  to 
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be  neglected,  provided  0  >  j 


2m  E^     Z^  \V: 
TpE_     l37 


)■'.. 


Hence  the  limitation  on  0  is  of  the 


same  magnitude  of 


com 


137  Z  1/3   which  is  the  boundary  region  where 


screening  becomes  important  in  the  single  differential  cross-section.  Extension 
of  (55)  to  smaller  angles  by  exact  integration  is  valid  only  at  energies  for  which 
screening  is  unimportant  in  the  single  differential  cross -section.  At  high  energies, 
the  small  angle  formula  requires  numerical  integration  as  the  form  factor  for  the 
atom  is  introduced.  But  for  zero  angle  scattering  at  high  energies,  there  is  no 
azimuthal  dependence  of  triple  differential  cross-section.  The  cross-section  (un- 
screened) is  given  by 


\    CO    J  71 


E_\    4:      El 


El  +  El^      E^E 
log 


{E^  +  E_ 


the  limitations  being 


E^,  E_~>m     and 


2w} 


2E^E_ 
com 


com 


2  m' 


«137Z-i/3. 


(59) 


(60) 


Thus  the  average  angle  into  which  the  electrons  are  emitted  is  of  the  order  of 

/yv)    pit 

6  ^ and  for  smaller  energies  the  concentration  in  the  forward  direction  is 

less  marked.  For  large  momentum  changes,  there  is  a  modification  of  the  Coloumb 
fie'ld  resulting  from  the  finite  extent  of  the  nucleus  which  will  give  an  appreciable 
decrease  in  the  cross-section. 

The  use  of  the  decomposed  propagator  helps  us  to  study  the  relative  contri- 
butions of  transitions  through  positive  and  negative  intermediate  states  to  the 
matrix  element  in  electrodynamic  processes  discussed  above.  Calculations  on 
the  Compton  effect  lead^  to  the  apparently  paradoxical  result  that  the  negative 
energy  intermediate  states  give  almost  the  entire  contribution  even  if  the  initial 
electron  is  non-relativistic  while  in  the  extreme  relativistic  case  contributions  arise 
from  transitions  to  both  positive  and  negative  energy  intermediate  states.  It  is 
conventionally  accepted  that  when  the  initial  particles  are  non-relativistic  the 
energy  denominators  become  large  for  negative  energy  states  which  therefore 
do  not  contribute.  This  may  lead  to  erroneous  results  since  as  in  the  Compton 
effect  the  numerators  may  also  become  large  a  circumstance  which  can  be  revealed 
only  by  the  use  of  the  decomposed  propagator.  Since  in  the  case  of  annihilation 
and  creation  of  pairs  the  non-relativistic  limit  is  not  possible,  contributions  are 
made  both  by  positive  and  negative  energy  intermediate  states.  In  Brems- 
strahlung,  in  the  non-relativistic  limit  for  the  outgoing  photon  the  positive  energy 
intermediate  state  contributes  to  a  larger  extent. 


^  Alladi  Ramakrishnan,  T.  K.  Radha  and  R.  Thunga,  Proc.  Ind.  Acad.  Sci. 
228  (1960). 
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\      Energy 

Non-relativistic 
{coi  ~  ft)2  <  m) 

Extreme  relativistic  {coi  ~  ^2  >  m) 
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Intermediate 
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Intermediate 

Energy 
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Positive 
Intermediate  Energy 

AA' 
BB' 

AB' 

\  cos^e 
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0)i                    / 

BA' 

4m2 

TAe  differential  cross-sections  for  Compton-scattering 

Electron-electron  interaction 

The  quantum  electrodynamic  processes  we  have  considered  till  now  relate  to 
the  scattering  of  an  electron  in  an  electromagnetic  field  which  may  be  either  due 
to  a  photon  or  a  potential  of  a  heavy  stationary  nucleus.  We  shall  now  deal  with 
the  interaction  between  two  charged  particles,  in  particular  two  electrons. 

'A 


Fig.  9.  Electron-electron  interaction. 

Even  if  the  electrons  do  not  interact  we  have  to  deal  with  a  system  of  two 

particles  which  obey  the  Pauli  principle.  Hence  we  define  a  kernel  for  a  system  of 

two  free  Fermi  particles  representing  the  amplitude  that  the  particles  at  space-time 

points  1  and  2  move  to  3  and  4.  Obviously  the  kernel  should  be  written  in  the  form 

Kf{4,  3;  2,  I)  =  K%{3,  I)  ^^(4,  2)  -  K%{4,  1)  Z^(3,  2)  (61) 
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where  for  convenience  we  have  labelled  the  electron  at  1  as  'a'  and  that  at  2 
as  'b\  Hereafter  we  shall  consider  only  one  of  the  two  terms  comprising  Kp 
remembering  that  the  total  amplitude  can  always  be  obtained  by  interchanging 
the  proper  space-time  variables,  making  the  corresponding  changes  in  sign  and 
summing  the  amplitudes  so  obtained. 

Assuming  now  an  interaction  defined  by  a  potential  between  the  two  particles, 
we  are  first  tempted  to  write  the  perturbed  kernel  in  the  first  order  approximation  as 

Ki{4,  3;  2,  1)  =  Kf{4,  3;  2,  1)  -  ifRpi^,  3;  5,  6)  F(5,  6)  ^^(6,  5;  2,  1)  d(5)  d(6) 

=  Z^(4,  3;2,  1)-Z^(4,  3;2,  1).  (62) 

If  we  assume  only  a  space  dependent  potential  F(5,  6)  it  should  contain  the  factor 
d{tQ  —  ^5)  (i.e.  the  interaction  is  instantaneous).  In  the  case  of  electrons  we  write 

V{5,  6)  ^-^ T~  ^(^5  —  h)-  However  if  the  potential  associated  with  the  inter- 
ims "  ^e) 
val  (5,  6)  is  not  instantaneous  but  is  required  to  satisfy  the  demands  of  relativity, 

we  must  replace  d  (t^  —  ^g)  by 


^(^5  -  ^6  -  ^5-^6)   +   ^(^5  -  ^6  +   ^5-^6)   =   ^  (^  "  0   +   ^  (^  +   0-  (63) 

This  implies  that  |  ^5  —  ^e  I  i^  ^^^  time  interval  for  the  potential  to  travel  with  the 
velocity  of  light.  But  this  form  of  the  d  function  is  not  quite  correct.  Its  Fourier 
transform  contains  both  positive  and  negative  frequencies.  Since  it  is  natural 
to  identify  any  function  occurring  between  Kp{4:,  3;  6,  5)  and  Kp{Q,  5:2,  1)  as 
the  propagator  of  a  photon  the  transform  of  this  propagator  should  contain  only 

positive  frequencies.  We  choose  instead  of  —  [d{t  —  r)  -{-  d(t  +  r)] 

j[S^  {t-r)  +  d^  {t  +  r)]  =  2  dpit^  -  r^)  =  2  dp{s^)  (64) 

where  ~  i 

(3^,(^2)  =/e-'-«^-^. 

J  71 

0 

The  unpertiu-bed  kernel  can  be  written  as 

Kf{4,  3;  2,  1)  =llKp{4,  3;  6,  5)  Z^(6,  5;  2,  l)d(6)  d(5).  (65) 

From  (62)  we  get  the  first  order  term  of  the  perturbed  kernel  to  be 

Kl.  =  -i  e^fKF{4.,  3;  6,  5)  d^is^)  Kf{Q,  5;  2,  1)  d(5)  d(6).  (66) 

However  even  this  modification  is  insufficient  since  it  implies  that  as  far  as  the 
electrons  are  concerned,  they  are  unaffected  by  the  exchange  of  a  photon,  the  pro- 
pagator of  which  is  given  by  d+  {s^).  To  include  the  effect  on  the  electron  we  first 
note  that  the  unperturbed  kernel  can  be  written  in  the  equivalent  form  (con- 
sidering only  one  of  the  terms  in  (61) ) 

Kf=JKf{4.,  Q)Kf{^,  2)  ^^(3,  5)Kf{5,  l)d(5)d(6).  (67) 
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In  this  picture  we  are  concerned  with  the  amplitude  for  the  electron  'a'  at  1 
going  to  3  through  5  and  (b)  at  (2)  going  to  (4)  through  (6).  In  the  perturbed 
case  this  amplitude  should  be  altered  due  to  the  emission  of  the  photon.  This 
is  achieved  not  merely  by  the  addition  of  dp  (s^)  but  also  by  inserting  the  matrix  y^ 
in  between  the  kernels  of  each  electron 

Zi,(3,  4;  1,  2)  =  -ie^jK^i^,  6)}V^i^(6,  2)^^,(52)^^(3,  5)y^ 

Kf{5,  l)d(5)d(6).  (68) 

We  are  led  to  the  choice  of  y^  on  using  the  following  argument.  Since  in  classical 
electrodynamics  there  is  also  an  interaction  through  the  vector  potential,  the 
complete  interaction  should  be  [1  —  (F5  •  VQ)]dp{slQ)  or  in  the  relativistic  case 

{l-oc^'oc^)  dpislo)  =  Pa  fif,  y^^  yi,^  dp {s^)  (69) 

where  the  suffices  a  and  b  indicate  that  the  quantities  "belong"  to  the  particles 
a  and  b  respectively.  Hence  we  have  for  electrons  obeying  the  Dirac  equation 

Zi,(3,  4;  1,  2)  =  -ie^jK^iS,  5)  iC|,(4,  6)  y,^y,^ 

dp{s^)K^{5,  1)  K^p{Q,  2)  d(5)  d(6)  (70) 

where  y^^  and  y^^  are  the  Dirac  matrices  applying  to  the  spinors  corresponding 
to  the  particles  a  and  b  respectively.  We  can  drop  the  suffices  a  and  b  if  we  ^vrite 
the  kernel  in  the  form  (68). 

The  expression  (70)  for  the  perturbed  kernel  can  be  derived  rigorously  if  we 
assume  that  the  two  electrons  are  perturbed  by  a  photon  field.  The  amplitude 
that  the  electron  'a'  goes  from  1  to  5,  emits  a  photon  of  polarization  e^  with  a 
momentum  fe  at  ^5,  and  then  goes  to  3,  is 

^^(3,  5)e,Y,]/^^e-''"r.e»"'^KH5,  1)[.  (71) 

Similarly  the  amplitude  that  the  electron  'b'  goes  from  2  to  6,  absorbs  a  photon 
of  polarization  e^  and  momentum  feg  ^^  h  ^^^  then  goes  to  4,  is 

Z^(4,6)e,y,|/|^e^'^^-»e^^-^Z^(6,2)|   .  (72) 

If  there  has  to  be  an  interaction  between  the  two  electrons  'a'  and  'b\  then  '6' 
must  absorb  the  same  photon  emitted  by  'a\  This  is  guaranteed  if  the  amplitude 
for  both  these  processes  to  happen  is  taken  as  the  product  of  the  two  amplitudes 
(71)  and  (72)  with  \^  k^,  r}^  [jl.  The  total  amplitude  for  the  exchange  of  any 
photon  between  5  and  6  is  obtained  by  assuming  all  the  polarization  vectors  to 
be  unity  and  summing  over  photon  polarization  and  momentum  and  integrating 
over  t^  and  t^.  Thus  the  amplitude  is 

4jre22;/e-^'"(*'''-*'«>e^^^^^«-^«U-^i^(3,  5)y,KF{^,  \)\ 

a 

1       d^k 

[Kp{4:,Q)y^Kp{Q,2)\-^-^-dt,dt,     for     t,>t,.  (73) 
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When  ^6  <  h  ^^^  amplitude  is  obtained  by  reversing  the  signs  of  the  exponents. 
We  have  only  a  three  dimensional  integration  as  the  fourth  component  k^  of 
the  four  vector,  k  is  related  to  k.  As  in  the  case  of  the  electron  the  summation 
over  all  the  momenta  (with  equal  weight)  gives  the  propagator  dp{s^).  Thus 

dp{s^)  =  4  7rre-fe(^»-"«)e^^^(^-^>     J        -^.  (74) 

J  [Z  jt)        Z  fC^ 

We  can  convert  the  above  into  a  four  dimensional  integral  using  the  lemma 

oo 
-oo 

Thus  6+  [s^)  is  the  inverse  of 

Qik-{Xi-Xe)        ^^  ^  _    f*  Qik{Xs-Xe) 


oo 


/pi  A;  •  (iCj  -  aje)                                          r  Qik{Xi-Xt) 
TV— 7^ r-dn^-47z{-i)   —- ^— d^A;.  (76) 

kl-k^  +  i£  ^       J     k^  +  ie  ■      ' 


•  oo 


Note  that  k^  and  k  do  not  have  the  energy-momentum  relationship  though  they 
form  a  four  vector.  Thus  the  momentum  representation  of  Sj^{s^)  is  ( — i)4:jilk'^. 

It  may  be  intriguing  that  in  this  derivation  we  assumed  that  the  perturbation 
of  the  kernel  acts  through  a  real  photon.  But  it  should  be  remembered  that  the 
summation  over  all  these  photon  functions  yielded  the  kernel  of  the  photon  and 
that  the  final  expression  is  only  a  kernel.  But  in  the  case  of  interaction  between 
two  electrons  we  remind  ourselves  that  we  require  the  amplitude  for  transitions 
between  states  of  real  electrons.  These  amplitudes  are  obtained  by  multiplying 
the  perturbed  kernel  by  wave  functions  and  their  conjugates  corresponding  to 
the  incident  (at  1,  2)  and  emergent  (at  3,  4)  particles  and  integrating  over  the 
points  1, 2, 3, 4,  and  are  non-vanishing  only  if  the  momentum  of  the  photon  is  equal 
to  the  difference  between  the  four  momenta  of  the  electrons  i.e.  q  can  represent 
a  virtual  photon. 

We  shall  make  a  digression  and  discuss  the  propagator  of  a  photon  in  com- 
parison with  that  of  an  electron.  Examining  the  definition  of  Sp{s^)  we  note  that 
we  have  taken  the  energy  of  a  photon  k^  to  be  always  positive.  When  t(=t^  —  t^) 
is  negative,  we  have  replaced  k^  by  — k^  only  in  the  exponent.  But  we  can  also 
replace  the  term  2^:4  in  the  denominator  by  — 2k^  and  attach  a  minus  sign  to  the 
integrand.  Then  we  immediately  recognise  that  when  ^g  <  t^  we  can  interpret 
the  absorption  of  the  positive  energy  photon  at  t^  as  the  emission  of  a  negative 
energy  photon  at  t^  and  in  a  similar  way  the  emission  of  a  positive  energy  photon 
at  ^g  as  the  absorption  of  a  negative  energy  photon  at  Iq  .  Thus  we  can  always 
ascribe  only  emission  of  photons  (both  of  positive  and  negative  energy)  to  the 
vertex  t^  and  absorption  of  photons  at  the  vertex  ^g .  This  implies  the  propagation 
of  a  positive  energy  photon  for  t^  <  ^g  which  is  "forward"  in  time  and  the  propa- 
gation of  a  negative  energy  photon  for  t^  >  ^g  which  is  "backward"  in  time.  Thus 
the  procedure  is  identical  with  that  of  the  electron  except  that  we  do  not  have  a 
term  arising  from  the  spinors  and  the  mass  m  is  zero  in  this  case. 
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In  this  point  of  view  the  minus  sign  occurs  before  the  sum  over  negative  energies 
just  as  in  the  case  of  the  electron  propagator.  We  have  earher  argued  that  this 
minus  sign  is  intimately  related  to  the  Pauli  principle  which  has  to  be  invoked 
in  dealing  with  a  second  order  scattering  of  electron  involving  the  creation  of 
a  virtual  pair.  But  we  know  that  photons  are  bosons  and  the  occurrence  of  minus 
sign  may  seem  perplexing.  The  question  also  arises  why  the  postulate  of  a  negative 
energy  sea  has  not  to  be  invoked  even  in  the  boson  case  to  explain  the  negative  sign. 
Perhaps  the  answer  may  be  in  that  we  always  assume  an  interaction  which  is 
bilinear  in  the  fermion  field  variables  while  linear  in  the  boson  field  variables.  The 
bilinear  nature  implies  that  the  destruction  of  a  negative  energy  particle  which 
is  equivalent  to  the  creation  of  a  positive  energy  antiparticle  should  be  associated 
with  the  creation  of  a  positive  energy  particle.  Thus  the  propagator  of  a  negative  energy 
electron  from  A  to  B  can  be  associated  with  the  destruction  of  a  real  electron 
at  A  and  creation  of  a  real  electron  at  B  and  the  negative  sign  plays  a  role  in 
the  statistics  of  these  electrons.  A  similar  circumstance  does  not  arise  in  the  case 
of  the  photon  because  of  the  linearity  of  the  interaction  in  the  photon  field. 

We  shall  now  obtain  the  matrix  element  for  electron-electron  scattering.  The 
two  diagrams  corresponding  to  this  process  are 


q"P3~PrP2-P4  q  =  P2-P3=P4-P| 

(R)  (S) 

Fig.  10.  Feynman  diagrams  for  the  electron-electron  scattering. 


The  amplitude  for  the  process  is  given  by 

<='i;r[^F(3,5)y^Xp(5,  l)]„-i^[i:j.(4,6)y^irp(6,2)]j-^,d(5)d(6).      (77) 

The  states  at  1,  2,  3  and  4  are  free  particle  states  represented  by  plane  waves. 
In  momentum  representation  the  spinor  parts  of  the  brackets  are  Usy^tUj^  and 


I 
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UiYfiU^.  Integration  over  (5)  and  (6)  gives  conservation  of  four-momenta.  Thus 
the  matrix  element  M  is  given  by 


ilf  =  4  71  e^ 


(78) 


(Pa-Pir  (P4-P1)' 

The  first  and  second  terms  come  from  R  and  S  respectively.  The  summation 
over  ju  is  implied.  We  shall  now  show  that  this  is  exactty  equivalent  to  an  instant- 
aneous Coulomb  interaction  and  exchange  of  transverse  photons.  We  choose  one 
of  the  space  directions  parallel  to  the  photon  momentum  q  and  the  other  two 
transverse  to  it.  We  can  now  split  the  matrix  element  as 

^^^2  =  (^4 rt ^2) -^ (^3  Vt ^1)  -  {u^ yq ^2) -^ («*3 yq ^1) 

-  Z  («*4  ytf  ^2)  -^  («*3  ytr  %)  •        (79) 

l,2tr  q 

We  note  that 

Ui(]U2  =   2^4 (P2  —  P4)^2  ^  ^(^2^4  —  U^U^)  =-  0.  (80) 

Substituting 

yQ  =  q.ytlQ  (81) 

M  1     .        „   _  „  1 


47re2     ^  (^4n^2)-2"(l  -qllQ^)iU3yt^l)-    i:  {UiytrU2)^{U3ytTK 


l,2tr 


-  -  (?^4  n  -^2)  7^  (%  yt  %)  -  i;  («^4  ytr  ^2)  -^  («*3  nr  %)  •  (82) 

V  i.2«r  q 

The  Coloumb  potential  which  has  only  spatial  dependence  ^^  {\r\){A=^0;q)  =  Ze^jr) 
is  the  time-like  component  of  the  four  vector  A.  Thus 

A(r)  =  y,Ze2/r.  (83) 

Taking  the  Fourier  transform  we  have 

a(^)  =-^k{r)  e-^'^-d'q  =  yt4.7iZ  e^d{q,)  i-  (84) 

d{q^)  occurs  due  to  the  time  independence  of  A.^ 


4.  SELF-ENERGY  AND  RENO  RMALIZ  ATION 

Once  we  have  envisaged  intermediate  states  where  energy  and  momentum 
need  not  be  simultaneously  conserved,  it  is  clear  that  we  have  to  take  into  con- 
sideration processes  of  the  following  type :  A  free  electron  emits  a  virtual  photon 

^  It  is  interesting  to  consider  the  problem  when  the  masses  of  the  charged  particles  are 
taken  as  different.  In  such  a  case  the  Pauli  principle  need  not  be  invoked  and  in  the  limit 
when  one  of  the  masses  becomes  infinite  it  can  be  shown  that  the  interaction  reduces  to  that 
of  a  pure  potential.  Recoil  corrections  can  also  be  studied  as  in  the  case  of  Bremsstrahlung. 
See  Alladi  Ramakrishnan,  S.  Indumathi  and  G.  Bhamathi,  Proc.  Ind.  Acad.  Sci.,  A  53, 
206  (1961). 
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which  it  subsequently  reabsorbs  (this  is  usually  termed  interaction  with  its  own 
radiation  field)  or  a  photon  creates  virtual  pairs  which  subsequently  annihilate 
themselves  creating  the  photon.  These  are  called  self-energy  processes  and  we 
shall  consider  them  below  in  detail. 

A  free  electron  even  in  the  absence  of  an  external  electromagnetic  field  can 
emit  a  virtual  photon  and  reabsorb  it.  The  free  electron  kernel  we  till  now  con- 
sidered is  a  two-point  function  which  does  not  include  this  process.  On  the  other 
hand,  the  kernel  representing  the  amplitude  of  going  from  one  space  time  point 
to  another  when  the  emission  and  absorption  of  photons  in  between  in  the  Feynman 
sense  is  taken  into  account  is  the  propagator  perturbed  by  the  self- field  and  indeed 
should  be  considered  the  propagator  of  a  physical  particle  in  the  absence  of  an 
external  field.  The  perturbed  propagator  for  the  photon  is  obtained  by  the 
inclusion  of  the  process  of  creation  of  a  pair  by  the  photon  and  the  annihilation 
of  that  pair  with  the  creation  of  the  photon.  In  a  similar  manner,  the  vertex 
operator  has  to  be  modified  by  the  emission  and  reabsorption  of  a  photon  across 
the  vertex. 

To  obtain  the  kernels  and  the  vertex  operator  of  the  physical  particle,  we  start 
with  the  kernel  of  the  bare  particle  defined  by  its  mass  and  wave  function  and  use 
the  standard  techniques  of  Feynman  theory.  Since  any  free  particle  kernel  is 
defined  through  its  mass  and  wave  function,  the  modification  in  the  kernel  can 
be  interpreted  as  changes  in  mass  and  wave  function.  The  mass  of  the  physical 
particle  is  identified  with  the  mass  we  observe  in  the  real  world.  This  principle 
of  identifying  the  attributes  of  the  physical  particle  with  the  observable  attributes 
is  known  as  the  principle  of  renormalization.  The  importance  of  recognizing  this 
is  essentially  due  to  the  fact  that  if  the  physical  mass  is  to  be  treated  as  the  bare 
mass  plus  a  correction  arising  from  the  self  interaction,  the  correction  turns 
out  to  be  an  infinite  quantity.  The  principle  of  renormalization  asserts  that  it 
is  after  the  addition  of  this  infinite  quantity  that  we  obtain  the  finite  observed 
mass  of  the  physical  particle.  Since  the  self-energy  process  involves  an  emission 
and  an  absorption  of  a  photon  and  therefore  contains  a  factor  e^  along  with  the 
wave  function,  instead  of  treating  the  wave  function  as  modified  due  to  these 
processes  we  can  take  the  charge  to  be  modified.  Likewise,  the  modifications 
in  the  vertex  operator  and  photon  propagator  can  also  be  interpreted  as  suitable 
modifications  in  the  charge  and  the  principle  of  renormalization  applied  to  the 
charge. 

It  may  seem  at  first  sight  that  the  principle  of  renormalization  is  merely  a 
justification  for  taking  the  observed  values  for  the  mass  and  charge.  However 
it  will  be  shown  that  it  leads  to  observable  effects  which  are  not  predicted  if 
these  self-energy  processes  are  not  taken  into  account.  This  is  because  even  after 
the  renormalizations,  there  exist  finite  corrections  depending  on  the  nature  of 
the  process  considered.  The  computation  of  these  corrections  can  be  carried 
out  by  the  regularization  of  the  infinite  integrals  which  means  expressing  the 
infinite  integrals  as  the  limits  of  integrals  which  can  be  evaluated  and  which 
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involve  an  auxiliary  parameter  with  respect  to  which  the  limiting  process  is 
defined^. 

We  shall  now  consider  these  radiative  processes  in  detail  2. 

Self-ener^  of  the  electron 

We  shall  consider  the  perturbed  kernel  Kp  (2, 1)  to  the  lowest  order  of  a  particle 
going  from  1  to  2  emitting  a  photon  at  3  and  absorbing  it  at  4.  We  have 


P-k 


Fig.  11.  Feynman  diagram  for  the  self  energy  of  an  electron. 

or  in  momentum  representation, 

S'Av)  =  Sf  if)  +  ^F  (V)  2  iP)  Sf  iP) 
Sp{p)^ll{p-m) 


(2) 


where 


and 


2(p) 


■{''"J^ 


m 


y^ 


(47.3) 

J^  (p)  can  be  written  in  the  form 

2J{p)  =  A  +  {p-m)B  +  {p-  mfZciv) 


di^k. 


(3) 


1  W.  Pauli  and  F.  Villars,  Rev.  Mod.  Phys.  21,  434  (1949). 

2  In  this  section  we  follow  closely  the  treatment  of  J.  M.  Jauch  and  F.  Rohrlich,  The 
Theory  of  Photons  and  Electrons,  Addison-Wesley,  Cambridge,  Mass.  (1955).  See  also  F.  J. 
Dyson,  Phys.  Rev.  75,  486  and  1736  (1949);  R.  P.  Feynman,  Phys.  Rev.  76,  769  (1949); 
R.  P.  Feynman,  Lectures  delivered  at  the  California  Institute  of  Technology  on  Quantum 
Electrodynamics  (1953). 
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A  = 


271 


m 


B  = 


47r 


—  f 

in^  J 


d*k 


71^  J' 


{k^  —  m 


2\2    +    4 


and 


ZAP) 


1 


(P  — m2)2 
.       2-3^ 


r  dx 

J        X 


+ 


11 


27tm  [2(1  —  ^) 
p  +  m  r         1 


log^j 


with 


2m 

l2e(i 

-e) 

0 

-X 
} . 

(P'- 

m^) 

(_J,,i=5.±i,,,) 


(4) 


m" 


The  integrals  ^  and  B  are  logarithmically  divergent  and  this  presented  a  major 
obstacle  to  quantum  electrodynamics  for  many  years  till  the  renormalization 
procedure  was  adopted.  To  regularize  the  integrals  Feynman  introduced  a  cut-off 

1  .  1  1 


factor  A  such  that  the  photon  propagator  instead  of  being  -rg  is  taken  as  ^^ 
where  A  ->  oo.  In  such  a  case  A  and  B  are  obtained  as 


k^-}} 


2n 


m 


log 


m 


B 


4:71 


J. 

21og 1—4  / h  — 

\       ^  m         /         J     X    ^   2 


To  interpret  the  term  A,  we  consider  the  expansion 

1  1.1 


Sf  {p,  m-\-  dm)  = 


-f 


dm 


m 


+ 


(6) 


(7) 


p  —  m        p  — m  p 

Comparing  this  expansion  with  that  of  2j  (p)  we  identify  A  to  he  dm  which  can 
be  considered  as  the  correction  to  the  bare  mass  m.  Since  A  is  infinite,  we  invoke 
the  principle  of  renormalization  and  identify  m  +  dm  to  be  Wq,  the  observed 
mass  which  is  finite.  The  perturbed  propagator  Sp{p,  m)  is  now  expressed  in 
terms  of  the  physically  significant  m^  as 

8'p{p,  m)  =  8p{p,  mo)  +  Sp{p,  mQ)[{A  —  (5 m)  +  (p  —  mo)  B 

+  (p-mo)2  2;,(^)]^^(p,mo)i.  (8) 

The  above  prescription  for  writing  8p  {p,  m)  can  be  stated  in  the  following  manner. 

1  In  writing  this,  we  must  remember  that  dm  occurs  with  a  factor  e^  and  terms  of  the 
order  e*  and  higher  are  neglected. 
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A  theory  using  the  bare  ynass  and  including  self  action  is  equivalent  to  a  theory 
using  the  observed  mass  plus  self  actioyi  minus  dm  computed  as  a  correction  to  the 
bare  mass  m  of  a  free  particle. 

The  remaining  terms  in  the  expansion  of  Sp[p,  m)  can  be  written  as  (1  +  B) 
[Sp{p,  m^)  +2]c{P>  ^o)]  *o  order  e^.  Since  the  propagator  Sp  {p)  is  the  transform  of 
the  kernel  ^y;„  (2) y)^  (1 ),  aS^  {p,  m)  can  be  considered  as  the  corresponding  transform 

n 

of  a  kernel  with  a  re -defined  wave  function  (in  momentum  space) 

t/;;  =  (l  +  5)Va^^.  (9) 

Since  these  wave  functions  occur  in  association  with  the  charge  e  in  any  pertur- 
bation expansion,  we  can  retain  the  form  of  \p  by  attaching  the  factor  (1  +  J5)i/^ 
to  the  unrenormalized  charge  e.  This  may  be  shown  in  the  following  way.  Consider 
an  71  th  order  diagram  in  any  quantum  electrodynamic  process  containing  Sg. 
external  electron  lines  and  S^  internal  electron  lines.  There  are  two  electron  lines 
coming  into  each  corner  or  the  number  of  electron  line  endings  i^  2n.  But  each 
internal  electron  line  has  two  endings  and  each  external  electron  line  one  ending. 
Thus  ^ 

2n  =  2Si-\-S,     or     S^^n ^.  (10) 

Li 

The  second  order  radiative  corrections  are  taken  into  account  by  replacing 
every  S^[p)  by  (1  +  B)\8:p{p)  ^-^c{V)\  every  \p  by  (1  +  B^l^ip  and  every  jp  by 

(1  +  B)^l^xp  in  the  original  diagram.  There  are  (  n ~  |  Sp  (^)'s,  -^  tp's  and  -—  ^'s 

originally.  Thus  the  whole  matrix  element  is  modified  by  finite  terms  only  and 
the  infinite  terms  can  be  grouped  together  into  a  multiplicative  factor  of  the  form 

(1  +  J5)    2  (i_|-^)2  =(l  +  5)^  (11) 

Jince  there  are  exactly  n  e's  occuring  in  the  matrix  element  corresponding  to  the 
corners,  we  can  eliminate  (1  +  B)^  by  defining  the  renormalized  charge  as 

eo^(l  +  i^)e.  (12) 

Replacement  of  every  e  by —  results  in  the  cancellation  of  the  infinite  factor. 

We  are  finally  left  with  a  finite  matrix  element  which  takes  into  account  the  second 
order  radiative  corrections.  However,  to  complete  the  renormalization  of  charge 
to  this  order,  we  have  to  take  into  account  other  processes  which  will  be  considered 
p^esentl3^ 

Jelf-energy  of  the  photon  ^ 

In  discussing  the  photon  propagator,  we  shall  consider  the  creation  of  an  electron 
positron  pair  by  the  photon  and  the  subsequent  annihilation  of  the  pair  with  the 
creation  of  the  photon.  The  photon  propagator  which  includes  the  creation  at> 

1  J.  M.  Jauch  and  F.  Rohrlich,  loc.  cit.;  J.  Schwinger,  Phys.  Rev.  74,  1439  (1948); 
G.  Wentzel,  Phys.  Rev.  74,  1070  (1948). 
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vertex  1  and  the  annihilation  at  vertex  2  is  the  perturbed  photon  propagator  to 
the  lowest  order  and  is  given  by^ 

Df^^M  =  ^^v^Fik)  +  Dp{k)  n^,{h)  Dp{k)  (12a) 

1 


where 


Dp{k) 


and 


^^.W  =  -4^Spur  fy^ — ^- — y, i- d*^.  (13) 

^  47r  J  ^  D  —  m       D  —  k  —  m 


Fig.  12.  Feynman  diagram  for  the  self  energy  of  a  photon. 

"The  integral  is  quadratically  divergent.  However,  we  can  reduce  the  degree  of 
divergence  by  invoking  the  principle  of  gauge  invariance.  The  photon  propagator 
Dp  {k)  is  bilinear  in  a^,^  so  that  if  we  make  the  gauge  transformation 

a,.{k)^a,{k)  +  k^Xik)  (14) 

in 

D^(k)n^,{k)D^(k) 

we  get  new  terms  of  the  form 

1  EarUer  in  this  chapter  we  denoted  the  photon  propagator  by  Dp.  In  a  strict  sense  it 
should  be  written  as  3f^yDp{k),  the  coefficient  (5^^  implying  that  fx  =  v  when  we  introduce  y^, 
and  y^  at  the  vertices  representing  emission  and  absorption.  In  modifying  this  propagator 
by  virtual  processes  of  annihilation  and  creation  of  pairs,  we  are  constrained  to  introduce 
two  indices  ju  and  v  since  we  are  treating  quite  generally  the  y^  interactions  at  the  vertices. 
It  will  be  proved  that  ITf^^ik)  will  be  of  the  form  dj^j,n{k^). 

^  af^{k)  represents  the  amplitude  for  a  photon  to  have  a  momentum  k  and  polarization  //, 
i.e.  it  is  the  momentum  transform  of  ^^(x).  In  studying  the  electron-electron  interaction 
through  the  exchange  of  a  photon  at  the  two  vertices,  we  had  set  the  magnitude  of  a^(fc) 
equal  to  unity  apart  from  a  factor  of  y47r/2  co  and  that  is  why  only  y^^  occurred  at  the  vertices. 
However  since  we  wish  to  introduce  transformations  we  have  to  use  general  a^^  's  along  with 
the  subsidiary  condition. 
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We  therefore  require 

K^f*v{k)  =  0         n^^{k)k^  =  0  (15) 

to  maintain  gauge  in  variance. 

Since  77^,  ^  (k)  is  a  second  rank  tensor  and  depends  only  on  A;^ ,  it  can  be  written 
in  the  form 

n,M  -  G{k^)k^K  +  D{k^)d,,.  (16) 

Therefore  from  (15)  we  get 

k^Cik'')  +  i)(P)  -  0  ■  (17) 

or 

n,M=(k,A-k%^)C(B)  (18) 

or 

Z^^.n^,{k)^n';,{k)  =  -^k^c{k^)  (i9) 

""""^  i72(0)  =  0.  (20) 

However  n^(0)  when  evaluated  from  (13)  turns  out  to  be  quadraticaUy  divergent. 
Hence  we  subtract  the  term  out  from  (13)  in  order  to  preserve  gauge  in  variance. 
The  resultant  integral  can  be  evaluated  and  is  only  logarithmically  divergent. 

It  may  be  noted  however  ^  that  by  treating  the  photon  as  a  neutral  vector 
meson  of  mass  x,  the  term  11^(0)  can  be  identified  as  a  mass  term  and  eliminated 
by  mass  renormaHzation.  Finally,  the  limiting  process  a,8  k-^0  can  be  adopted 
to  obtain  a  gauge  invariant  expression  identical  with  the  one  got  by  our  procedure. 

To  return  to  our  discussion,  C{k^)  can  be  written  as 

C(F)=  C(0)-Fi7e(P)  (21) 

where  (7(0)  is  logarithmically  divergent  while  77^  (k^)  is  convergent.  C(0)  and  17^  {k^) 
are  given  by 

and 

Bn,(V)^—      J^+...  (23) 

for  small  k^jAm^. 

If  now  IJuv  (k)  is  inserted  into  an  external  photon  line,  the  k^^k^  term  cannot 
contribute  since  A;^a^,  ^  0  for  a  free  photon.  Suppose  however  that  the  insertion 
is  into  an  internal  photon  line.  Then  corresponding  to  each  such  diagram  we  can 
obtain  two  new  diagrams  by  omitting  the  self-energy  part  as  shown  in  Fig.  13. 
Each  of  these  new  diagrams  gives  terms  of  the  form  Ay^^a^^B  where  A  and  B 
are  contributions  from  the  rest  of  the  diagram.  In  variance  under  the  gauge 
transformation  a^  -^  a^^  +  k^^x  (^)  requires 

i;Ay,k^B^.O.  (24) 

1  P.T.Matthews,  Phys.  Rev.  76,  1657  (1949);  F.  Coester,  P%5.  Rev.  83,   798  (1951) 
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The  summation  is  over  all  the  different  diagrams  corresponding  to  the  different 
places  of  insertion  of  ^I^^^  {k). 


■J'j 


AAAA^ 


-AAAA 


!FiG,  13.  Inclusion  and  omission  of  the  self  energy  part  in  an  internal  photon  line. 

Now  the  original  set  of  diagrams  furnishes  an  expression  of  the  form 

(where  C  and  D  are 


SAv,B^n^,(k)~Cy,.D 


functions  similar  to  A  and  B) 


2AkB^Cr)^CkD 


k^ 


B 


(25) 


The  first  term  on  the  right-hand  side  has  been  shown  to  vanish.  Thus  TTft^ik) 
reduces  to 

n^.{k)=-k^C{0)  d^,  +  k^Hcik^)  d^,  (26) 

Thus  the  modified  photon  propagator  is  given  by 

D'p{k)=={l-  C)  Df, {k)     to  order  e^  (27) 

where  D'p{k)  d^,  =  D'^^,{k) 

DF^k)  =  Dpik)  +  n,{k^) 

and  Dp^(k)  is  the  renormalized  photon  propagator.  We  also  obtain  the  modified 
wave  function  for  free  photons  as 

a;W=(l-C)i/XW-  (28) 

As  we  have  mentioned  earlier,  the  modifying  factor  (1  —  CY^^  can  be  carried 
over  to  the  charge  since  with  each  vertex  a  factor  e  is  associated  and  the  number 
of  vertices  is  equal  to  the  number  of  times  the  factor  (1  —  C)^^^  occurs  in  any 
diagram.  This  can  be  shown  by  the  following  simple  argument. 
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Consider  an  n-th.  order  diagram  having  Pg  external  and  P^  internal  photon 

lines.  Since  one  photon  line  ends  at  each  corner,  the  number  of  photon  line  endings 

will  be  equal  to  the  number  of  corners.  Each  P^  has  two  endings  while  each  P^ 

n-Pe 
one  ending.  Thus  2P^  -r  Pg  =  w  or  P^  =^ — 


The  modified  matrix  element  is 


got  by  replacing  Dp{k)  by  Dp{k),  %{k)  by  al{k)  and  aj(fc)  by  aJ(A;)  in  the  ori- 
ginal diagram.  Thus  the  infinite  constants  can  be  grouped  together  as 


n-P« 


(1-c)  2  (i-c)2  =(i-cr/2 


(29) 


while  the  rest  of  the  matrix  element  is  modified  by  finite  terms  only.  The  infinite 
factor  can  as  before  be  removed  by  the  renormalization  of  charge  by  defining 
Cq  by  eQ=  (1  —  Cy^H,  e^  being  the  renormalized  charge.  The  final  result  thus 
contains  only  finite  and  observable  quantities. 

We  have  seen  above  how  the  photon  propagator  has  to  be  modified  if  we  in- 
clude virtual  annihilation  and  creation  of  pairs.  Inclusion  of  this  possibility 
implies  that  we  can  always  consider  even  in  a  vacuum  such  processes  to  occur 
and  the  amplitude  c  for  a  vacuum  remaining  a  vacuum  is  obtained  by  con- 
sidering such  diagrams  as  Fig.  14. 


Fig.  14.  Feynman  diagrams  representing  vacuum  processes. 


Since  in  the  absence  of  an  external  field  the  probability  |cp  for  a  vacuum 
remaining  a  vacuum  is  unity,  |  c  |^  =  e*'*  where  (x  is  real.  In  any  process  involving 
real  particles,  we  have  to  include  these  vacuum  diagrams  from  which  no 
external  lines  emerge.  Each  of  the  vacuum  diagrams  is  divergent  and  this  implies 
that  the  phase  factor  oc  is  infinite.  However  this  is  going  to  be  of  no  consequence 
since  the  only  effect  of  such  an  inclusion  is  to  multiply  the  real  process  by  a  numerical 
phase  factor  which  is  just  the  amplitude  for  a  vacuum  remaining  a  vacuum. 
The  modification  of  all  real  processes  by  a  common  phase  factor  can  have  no 
observable  effect  and  hence  we  need  not  consider  these  diagrams  at  all.  In  fact 
we  shall  see  in  the  definition  of  the  propagator  in  field  theory  that  this  factor  is 
cancelled  by  dividing  any  amplitude  by  the  amplitude  of  a  vacuum  remaining 


a  vacuum. 
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An  external  electromagnetic  field  will  be  modified  by  the  inclusion  of  the 
processes  of  virtual  pair  creation.  This  is  known  as  the  vacuum  polarization 
effect  due  to  the  external  field,  the  word  "polarization"  being  used  evidently  to 
denote  that  when  we  have  an  external  field  it  also  implies  a  polarization  of  the 
medium  due  to  the  creation  of  virtual  pairs 

The  vertex  part^ 

The  diagram  corresponding  to  the  second  order  radiative  correction  to  the 
vertex  operator  y^  is  shown  in  Fig.  15. 


^J\f\f\f\f\f\A 


P-k 


Fig.  15.  Second  order  radiative  correction  to  the  vertex  operator. 

The  inclusion  of  this  correction  replaces  the  vertex  operator  y^  by 

r,,  =  y,  +  A,Av',v)  (30) 


where 


A,{V',  V)  = 


ie^    r 


7a 


-y^' 


r^ 


(31) 


p'— k  — m    '^'^    p— k— m 

The  integral  is  logarithmically  divergent  and  can  be  treated  in  a  way  similar  to 
2{p)'  The  final  result  may  be  written  as 

A^{p\  p)  =  Ly^  +  A^,{p',  p).  (32) 

^ncip'^P)  is  convergent  and  is  uniquely  defined  by  the  condition  A^c{p',  p)  =  0 
for  p  =  p'  =  m  (i.e.  they  correspond  to  free  particles  and  are  equal  in  magni- 
tude). X  is  a  logarithmically  divergent  integral  and  is  given  by 

1 


L  = 


4:71 


,i7r  J 


d^k 


(F  _  ^2)2 


/" 


da:        11 


(33) 


1  J.  M.  Jatjch  and  F.  Rohrlich,  loc.  cit;  R.  P.  Feynman,  Phys.  Rev.  76,  769  (1949). 
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which  is  recognized  to  be  equal  to  — B.  This  result  is  a  special  case  of  a  general 
identity  due  to  Ward^. 

The  general  form  oi  A^^('p' 'p)  is  quite  complicated.  However  when  p'  and  p 
correspond  to  free  particles,  the  regularized  expression  for  it  may  be  shown  to  be 


A,e(?'»  = 


sin  20 


(34) 


where  Amm  is  a  small  mass  attributed  to  the  photon  as  an  infra-red  cut-off  by 
replacing  k"^  by  M^  —  X^^^  in  the  denominator  of  the  integrand.  This  is  done  in 
order  to  remove  the  singularity  of  the  integrand  at  A;^  =  0.  d  is  given  by  the 
relation  4:m^s,w?d  =^  q^.  We  may  thus  write  the  modified  vertex  operator  as 

r^ip'p)={l  +  L){Y,.  +  ^,c(p'p))  (35) 

to  order  e^.  Since  exactly  one  e  occurs  at  each  corner,  we  can  remove  the  infinite 
factor  (1  +  L)  by  defining  the  renormalized  charge  Cq  by  Bq  =^  (1  +  L)e. 

We  thus  see  that  if  we  take  into  account  all  the  three  charge  renormalizations, 
the  final  Cq  will  be 

eo  =  (1  +  B){1  +  L)il-  C)^l^e  =  (1  -  1/2  C)e  (36) 

to  order  e^  since  B  =  — L. 

We  shall  now  give  two  examples  where  the  corrections  due  to  the  above 
processes  have  been  worked  out,  viz.  the  radiative  corrections  to  the  magnetic 
moment  of  the  electron  (which  gives  rise  to  its  anomalous  magnetic  moment) 
and  the  Lamb  shift. 

The  anomalous  magnetic  moment  of  the  electron  ^ 

The  basic  process  in  which  the  magnetic  moment  term  makes  its  appearance 
is  the  vertex  diagram  without  radiative  corrections.  Figures  16  and  17  give  the 
second  order  radiative  corrections  to  this  diagram.  The  diagrams  (a)  and  (a') 
contribute  only  to  the  mass  and  wave  function  renormalizations  since  the 
radiative  corrections  occur  only  in  the  external  lines.  The  corresponding 
charge  renormalization  is  spurious  cancelling  with  that  arising  from(b).  (c)  gives 
a  true  charge  renormalization  and  a  vacuum  polarization  effect,  but  since  this 
is  something  that  modifies  the  external  potential  and  not  the  properties  of 
the  electron  it  cannot  contribute  to  the  magnetic  moment,  (b)  makes  a  finite 


1  J.  C.  Ward,  Phys.  Rev.  78,  182  (1950). 

2  J.  ScHWiNGER,  Phys.  Rev.  73,  416  (1948)  and  76,  790  (1949);  R.  Karplus  and  N.  M. 
Kroll,  Phys.  Rev.  77,  536  (1950). 
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contribution  to  the  magnetic  moment  of  the  electron.  The  total  contribution  to 
the  matrix  element  from  the  first  three  diagrams  can  be  shown  to  be  (cf.  Eq.  (34)) 


(37) 


(a)  (a') 

Fig.  16.  Electron  self-energy  corrections. 


(a)  (b) 

Fig.  17.  (a)  Radiative  correction  to  the  vertex; 
(b)  Vacuum  -polarization  diagram. 

when  q-~=  ^2  —  Vi  i^  small.  The  above  M  will  thus  modify  the  Coulomb  scattering 
and  we  will  now  show  that  the  first  term  in  M  will  give  rise  to  an  additional 
magnetic  moment.  We  have 

a  q  —  q  a  =  y^,y^  {a^,q^  —  q^,a^)  (38) 

which  becomes  in  coordinate  representation 

Y,7.(KA,.  -  v,A.)  -  -y.y.-f-v  (39) 

where  F^^  is  the  electromagnetic  field  tensor.  But  the  effect  of  a  particle  having 
a  magnetic  moment  //  is  to  subtract  a  potential  juy^y^Fi^^  from  the  ordinary 
potential  A  =  y^-^in  appearing  in  the  Dirac  equation.  This  is  precisely  what  the 
first  term  of  M  does,  so  that  it  is  interpreted  as  a  correction  to  the  magnetic 

moment.  It  follows  that 

e     /  e^  \ 

//electron  =  ^ 1  +  -^ —    •  (40) 
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Karplus  and  KroU  have  calculated  the  corrections  to  the  magnetic  moment 
to  the  fourth  order  in  e.  Their  final  result  is 

/^electron=l  •0011454-^  (41) 

^  lit 

a  value  which  has  been  experimentally  verified. 

The  Lamb  shift  ^ 

According  to  the  Dirac  theory,  the  ^"^S-^i^  and  ^^P-^^j^  levels  of  hydrogen  should 
be  coincident.  However  when  the  radiative  corrections  are  taken  into  account, 
the  levels  get  separated  by  1057.19  ±  0.2  Mc/sec.  Precise  experiments  have  in  fact 
confirmed  this  level  shift  and  the  agreement  between  theory  and  experiment  is  at 
present  up  to  +0.22  ±  0.23  Mc/sec.  Thus  this  provides  one  of  the  most  accurate 
confirmations  of  the  correctness  of  the  renormalization  techniques  developed  in 
quantum  electrodynamics. 

In  order  to  proceed  with  the  calculations  in  this  case,  we  must  first  derive  an 
expression  for  the  bound  state  electron  propagator.  For  this,  we  note  that  if  A^^{x) 
is  the  external  potential  due  to  the  nucleus,  the  bound  state  wave  function 
satisfies  the  equation 

(^V  —  eA^)^  —  mip=^-0.  (42) 

We  define  the  Fourier  transform  of  A^  (x)  through 

A%(x)=^'j^jA%{q)6{q,)e'^--d^q.  (43) 

Then,  if  the  electron  is  scattered  from  the  state  p  to  the  state  p'  due  to  this 
potential,  the  electron  propagator  will  satisfy  the  equation 

(p'  -  m)  SUp'.  V)  =  J^Y,  f^iiP'  ~  P")  SW.  P)  d'p"  +  i  »(P'  "  P)  ■     (44) 

[Since  we  are  considering  Coulomb  scattering  the  fourth  components  of  p'  and  p 
|jare  the  same  here.  The  integral  equation  implies  that  we  are  considering /8'|T(y,  p) 
'as  the  solution  of  equation  (44)  when  ^^,(a:)  is  not  treated  as  a  perturbation, 
or  equivalently,  the  solution  is  taken  to  all  orders.  However,  the  radiation 
field  is  treated  as  a  perturbation. 

The  above  equation  may  be  symbolically  written  as 

1  W.  E.  Lamb,  Jr.  and  R.  E.  Retherford,  Phys.  Rev.  72,  241  (1947);  S.  Triebwasser. 
E.  S.  Dayhoff  and  W.  E.  Lamb,  Jr.,  Phys.  Rev.  89,  98  and  106  (1953);  H.  A.  Bethe,  Phys, 
Rev.  72,  339  (1947);  N.  M.  Kroll  and  W.  E.  Lamb,  Phys.  Rev.  75,  388  (1949);  J.  B.  French 
and  V.  F.  Weisskopf,  Phys.  Rev.  75,  1240  (1949);  R.  P.  Feynman,  Phys.  Rev.  74,  1430 
(1948);  M.  Baranger,  H.  A.  Bethe  and  R.  P.  Feynman,  Phys.  Rev.  92,  482  (1953);  R.  Kar- 
plus, A.  Klein  and  J.  Schwinger,  Phys.  Rev.  86,  288  (1952) ;  H.  M.  Fried  and  D.  R.  Yennie, 
Phys.  Rev.  112,  1391  (1958).  See  also  J.  C.  Gunn,  Rep.  Progr.  Phys.  18,  127  (1955)  whose 
treatment  we  follow  here. 
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The  main  contributions  if  radiative  corrections  are  included  arise  from  two  types 
of  graphs.  One  is  the  self-energy  part.  Here  it  is  the  correction  due  to  the  emission 
and  absorption  of  a  photon  to  a  bound  state  propagator,  represented  by  a  double 
line  in  the  diagram.  The  second  is  the  vacuum  polarization  graph.  These  are 
illustrated  to  order  e^  below: 


^°)  (b) 

Fig.  18.  Feynman  diagrams  for  Lamb  shift. 

(a)  Self -energy  diagram  for  the  bound  electron, 
(b)  Vacuum  polarization  diagram. 

The  contribution  due  to  the  first  type  of  graph  wiU  be 

xy^(pa{p)d^Jcd^p'd^p.  (46) 

Here  <pa  's  are  the  Coulomb  wave  functions  of  the  state  under  consideration  and 
may  be  written  as 

q^ai^)  =  V.{x)  6-^^"^         cp^ip)  =  -^^jn(x)  e"'^*" d^o;.  (47) 

If  the  total  contribution  due  to  this  diagram  is  denoted  by  M^'^\  then,  since  the 
interaction  Hamiltonian  makes  a  contribution  of  the  form  —i{a\Hi\a)  to  the 
second  order  matrix  element,  the  level  shift  A  E^  of  the  state  a  is  given  by 

A  Ea='ReiM(^)  =  6^14:71^ 
X  ///-^  9a{p')  7, SUp'~  k,p-k,Ea-  K)  y.niP)  d*^ d'p  d^p'.      (48) 

It  can  be  shown  that  here  also  renormalization  can  be  carried  out  using  the  free 
particle  counter- terms.  One  can  thus  obtain  a  final  convergent  result. 

By  a  similar  procedure  the  contribution  from  the  second  graph  can  be  calcu- 
lated. The  matrix  element  in  this  case  will  in  effect  replace  A^^  (q)  by  Af^{q) 
where 


<(9)  =  -^  "i/*^'*^     ^P"''     t''/''^^'*^)  y-'^f  (^  +  «)  ^'(?)]- 


(49) 
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The  charge  renormalization  is  carried  through  as  before  and  the  level  shift 
calculated.  The  total  contribution  from  the  two  graphs  is  a  very  good  approxi- 
mation to  the  observed  value.  Note  that  the  main  shift  comes  from  a  non-relativi- 
stic  contribution,  applying  only  to  the  2  aS  state  and  is  of  the  order  ofoc  (Zoc)^  log  Zoc 
where  Z  is  the  nuclear  charge.  The  corresponding  relativistic  contribution  is 
considerably  smaller  being  of  the  order  of  oc  (Zoc)^. 

The  removal  of  divergences  from  a  general  graph  ^ 

We  shall  now  consider  the  methods  of  removing  the  divergences  from  a  general 
71-th.  order  graph.  There  may  appear  in  the  /S'-matrix  three  distinct  types  of  in- 
finities which  may  be  classified  as  follows : 

(i)  The  ultra-violet  divergences  which  arise  because  of  the  integrand  not 
vanishing  rapidly  enough  at  large  momenta ; 

(ii)  Singularities  caused  by  the  coincidence  of  two  or  more  poles  of  the  integrand ; 

(iii)  The  infra-red  divergences  caused  by  the  presence  of  a  factor  —^  in  the  inte- 
grand. This  is  only  a  spurious  divergence  since  it  vanishes  when  all  the  diagrams 
contributing  to  a  particular  process  to  the  given  order  are  considered^. 

The  renormalization  procedure  is  devised  to  handle  only  the  first  type  of 
divergence.  Any  divergence  of  this  type  is  removed  if  a  sufficient  number  of 
integration  variables  (i.e.  internal  momenta)  are  kept  fixed.  This  is  equivalent  to 
cutting  the  corresponding  internal  lines  thereby  making  them  equivalent  to  a 
pair  of  external  lines.  A  divergent  graph  is  called  "primitive"  if  whenever  one  of 
the  4-momentum  vectors  in  its  integrand  is  held  fixed,  the  integration  over  the 
remaining  variables  is  convergent. 

We  now  define  the  "dimension"  D  of  an  integral  of  a  function  with  respect 
to  a  variable  as  equal  to  the  highest  power  of  the  variable  in  the  numerator 
minus  the  highest  power  of  the  variable  in  the  denominator,  each  differential 
inside  the  integral  contributing  unity  to  the  power  of  the  numerator.  It  is  clear 
then  that  D  should  be  less  than  zero  for  the  integral  to  be  convergent.  To  cal- 
culate D  we  note  first  that  integration  over  each  internal  electron  line  is  of  the 

form  / — : d*^  and  contributes  three  powers  to  D.  Similarly  each  internal 

photon  line  contributes  two  powers  to  D.  There  are  altogether  n  ^-functions, 
but  one  of  these  merely  expresses  the  overall  momentum  conservation  while 
each  of  the  others  contributes  a  power  — 4  to  D.  Integration  with  respect  to  a 
(^-function  just  amounts  to  stating  that  the  integral  is  equal  to  the  integrand 


1  F.  J.  Dyson,  Phys.  Rev.  75,  486,  1736  (1949). 

2  J.  M.  Jauch,  and  F.  Rohrlich,  Helv.  Phys.  Acta  27,  613  (1954);  F.  Rohrlich,  Phys. 
Rev.  98,  181  (1955);  F.  Bloch  and  A.  Nordsieck,  Phys.  Rev.  52,  54  (1937);  W.  Braunbeck 
and  E.  Weinmann,  Z.  Phys.  110,  360  (1938);  W.  Pauli  and  M.  Fierz,  Nuovo  Cim.  15,  167 
(1938);  R.  J.  Glauber,  Phys.  Rev.  84,  395  (1951);  W.  TmRRiNG  and  B.  Touchek,  Phil.  Mag. 
42,  244  (1951).  See  also  G.  Morpurgo,  Nuovo  Cim.  Suppl.,  109  (1951). 
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with  parametric  values  corresponding  to  the  singularity  of  the  ^-function.  Since 
we  have  generally  assumed  that  integration  contributes  +  4  to  D  we  have  to  assume 
that  the  ^-function  contributes  — 4.  But  the  (5-function  expressing  the  over  all 
energy  momentum  conservation  is  not  integrated  and  therefore  it  does  not  con- 
tribute to  the  power.  Thus  we  have 

D=3Fi+2Bi-4:{n-l)  (50) 

where  7i  is  the  number  of  vertices,  F^  the  number  of  internal  fermion  lines  and  B^ 
the  number  of  internal  boson  lines.  To  express  D  in  terms  of  the  number  of 
external  lines,  we  note  the  two  relations  we  have  already  derived,  viz. 

2Fi+F,^2n,        '  (51) 

2Bi+B,=-n.  (52) 

Therefore 

i>^4-3/2i;-5,.  (53) 

It  follows  that  the  graph  will  be  convergent  if 

D<0    or    3/2i^, +  5,  >4.  (54) 

Thus  the  important  fact  emerges  that  the  degree  of  divergence  is  independent 
of  the  order  of  the  diagram. 

We  wish  now  to  give  a  consistent  prescription  for  the  removal  of  the  infinities 
from  the  graphs  of  any  order  in  the  AS-matrix.  For  this  it  is  convenient  to  use  the 
following  terminology.  A  graph  is  called  improper  if  it  can  be  separated  into  two 
distinct  graphs  by  cutting  an  electron  or  a  photon  line.  A  proper  graph  is  one 
which  is  not  improper.  Thus  an  improper  graph  can  be  split  into  proper  graphs 
and  renormalization  carried  out  separately  for  each  such  part.  We  will  therefore 
be  concerned  only  with  proper  graphs.  The  part  of  a  diagram  which  is  connected 
to  the  rest  by  two  photon  or  electron  lines  is  called  the  self  energy  part.  The  part 
which  is  connected  to  the  rest  of  the  diagram  by  a  pair  of  electron  lines  and  a 
photon  line  is  the  vertex  part.  The  graph  obtained  from  any  graph  G  by  replacing 
all  the  electron  self  energy  parts  by  electron  lines,  the  photon  self  energy  parts  by 
photon  lines  and  the  vertex  parts  by  simple  vertices  (two  electron  lines  and  a 
photon  line)  i.e.  by  removing  the  insertions  on  the  lines  and  at  various  corners 
is  called  the  skeleton  of  (7.  A  graph  which  is  its  own  skeleton  is  called  "irreducible". 
Thus  a  "reducible"  graph  of  any  order  can  be  built  up  by  inserting  in  the  hnes  and 
vertices  of  its  skeleton  self  energy  and  vertex  parts  of  requisite  orders  in  all  possible 
ways.  All  such  reducible  graphs  may  be  divided  into  three  classes: 

(i)  graphs  in  which  no  two  insertions  overlap, 

(ii)  graphs  in  which  each  insertion  is  completely  contained  within  the  next, 

(iii)  graphs  in  which  insertions  overlap. 
For  the  first  type  of  diagram,  renormalizations  can  be  carried  out  for  each  separate 
insertion  by  the  methods  already  outlined.  For  the  second  type  we  may  succes- 
sively remove  the  divergences  from  each  insertion  starting  with  the  innermost 
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and  ending  with  the  divergence  arising  from  G  itself.  For  the  third  type  however, 
the  method  is  more  compUcated.  Consider  for  instance  in  Tig.  19  the  fourth  order 
electron  self-energy  part : 


P  p-ki  p-k-kj  p-k2  0 

Fig.  19.  Fourth  order  self  energy  diagram  with  overlapping  photon  lines. 

The  corresponding  matrix  element  is 

'^^^  1  p-ki  +  m 


i^//d^.,d^.,-,. 


p-k,-ka  +  m  p-k^  +  m  1 

ip-h- hf - ^'  '^  ip - hf  - ^'     ki  •  ^  ^ 

Keeping  either  k^  or  Aig  fixed  and  integrating  over  the  other  leads  to  a  logarithmic 
divergence,  while  the  double  integral  diverges  linearly.  The  procedure  to  remove 
the  divergence  here  has  been  given  by  Salam^  and  is  as  follows. 

Consider  for  simplicity  an  electron  self-energy  graph  with  three  integration 
variables  t^ ,  t^  and  ^3 .  First  keep  ^2  and  ^3  fixed  and  substract  the  divergence  for 
integration  over  t^.  This  divergent  term  is  obtained  by  setting  in  those  factors 
in  which  t^  occurs,  the  other  integration  variables  ^2  and  t^  equal  to  zero  and  the 
external  momenta  equal  to  their  corresponding  free  particle  values,  the  factors 
not  involving  ^^  being  left  unchanged.  This  is  done  in  turn  for  integrations  over 
^2  and  ^3  also.  The  next  step  is  to  substract  the  divergences  for  integrations  over 
the  pairs  of  variables  titj  (i,  j=  1,2,3)  from  the  integrand  obtained  after 
subtracting  the  divergences  for  t^,  t^  and  ^3  the  method  being  similar  to  that 
for  t^.  We  finally  repeat  the  procedure  for  integration  over  t^t^t.^,  the  divergent 
terms  being  separated  out  using  Dyson's  prescription.  The  resultant  integral 
after  all  these  subtractions  is  convergent  for  any  of  the  integrations  t^,  t^tj  or 
tyt^t^  and  is  the  finite  correction  to  the  electron  propagator  which  we  set  out 
to  evaluate.  The  procedure  outlined  here  has  also  been  generalized  to  any  arbitrary 
diagram  involving  n  variables  by  Salam. 


1  A.  Salam,  Phys.  Rev.  82,  217  (1951). 


CHAPTER  III 

FORMALISM  OF  QUANTUM  FIELD  THEORY 

1.  THE  STATE  VECTOR  AND  FIELD  OPERATORS 

Introductory  remarks 

We  have  till  now  been  confining  our  attention  to  the  equations  of  single  particles 
and  perturbation  theories  of  the  wave  functions  relating  to  these  particles.  The 
systems  containing  many  particles  were  described  in  terms  of  single  particle 
wave  functions  when  the  total  number  of  particles  was  assumed  to  be  constant. 
Such  a  description  cannot  include  the  concept  of  creation  and  annihilation  of 
particles  unless  such  creation  and  annihilation  are  treated  as  scattering  in  the 
Feynman  formalism. 

A  systematic  attempt  to  deal  with  a  multiplicity  of  particles  is  the  more 
sophisticated  field  theory i.  The  aim  of  field  theory  is  to  enlarge  the  scope  of 
quantum  mechanics  beyond  the  treatment  of  single  particle  wave  equations. 
We  know  that  the  interaction  between  two  fundamental  particles  like  electrons 
can  be  understood  through  the  exchange  of  photons,  both  real  and  virtual. 
One  of  the  effects  of  interaction  is,  for  example,  a  change  in  the  momentum  of 
the  interacting  particles.  If  the  momentum  of  a  particle  changes  from  'p  to  j)'  the 
process  may  be  viewed  as  the  destruction  of  a  particle  of  momentum  'p  and  the  crea- 
tion of  a  particle  of  momentum  p' .  Viewed  in  this  manner  a  system  of  fundamental 
particles  may  be  described  by  a  state  vector  involving  the  occupation  numbers 
of  particles  in  different  states  characterized  by  the  values  of  dynamical  variables. 
If  there  is  no  interaction  between  these  particles,  the  occupation  numbers  do 
not  change.  The  effect  of  the  interaction  is  to  make  these  occupation  numbers 
vary  with  time.  Our  object  will  now  be  to  study  the  main  features  and  properties 
of  these  state  vectors  and  thereby  prove  the  necessity  for  the  introduction  of 
field  operators  which  operating  on  these  state  vectors  yield  the  occupation 
numbers  and  their  mode  of  variation.  The  essential  peculiarity  of  field  theory 
can  be  summarized  in  a  naive  manner  as  foUows.  While  in  the  single  particle 
formaHsm  we  deal  with  the  wave  function  ip,  in  field  theory  we  have  to  deal 


^  Excellent  introductions  to  Quantum  Field  Theory  are  given  in  G.  Wentzel,  The 
Quantum  Theory  of  Fields,  Interscience  (1946);  S.  S.  Schweber,  H.  A.  Bethe  and  F.  de 
Hoffmann,  loc.  cit.,  J.  M.  Jauch  and  F.  Rohrlich,  The  Theory  of  Photons  and  Electrons, 
Addison-Wesley  (1955);  N.  N.  Bogoliubov  and  D.  V.  Shirkov,  Introduction  to  the  Theory 
-of  Quantized  Fields,  Interscience  (1959). 
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with  two  types  of  quantities,  tp  the  field  operator^  and  ^  the  state  vector.  ^F  bears 
a  correspondence  to  wave  function  yj  in  that  both  are  state  vectors  while  the  field 
operators  also  bear  correspondence  to  the  wave  function  since  they  have 
"equations  of  motion"  similar  to  single  particle  wave  equations.  That  is  why  a 
difficulty  is  felt  as  to  the  best  manner  of  introducing  the  two  fundamental  quan- 
tities, the  field  operators  y)  and  the  state  vector  W,  as  an  extension  of  the  single 
particle  formahsm.  Fermi ^  for  example  introduces  the  field  operators  in  direct 
analogy  with  ip  while  Bethe  and  Schweber  describe  W  in  analogy  with  yj.  The 
deductive  approach  will  be  to  develop  the  state  vector  W  in  analogy  with  the 
single  particle  wave  function  and  deduce  the  operators  which  satisfy  suitable 
commutation  relations.  In  the  absence  of  interactions,  the  number  of  particles 
present  in  a  field  is  a  constant  of  motion  and  this  situation  can  be  described  by 
the  Schrodinger  equation^ 


^n''  '') 


.  30 


Xnl  t) 


(1) 


H{x^) 


2m 


n-\-y{xic) 


(2) 


where  0  is  the  wave  function  (in  configuration  space)  of  an  n  particle  system 
and  {apyfc}  the  spatial  coordinates  of  the  aggregate  and  //  the  Hamiltonian  of  the 
system  which  is  a  function  of  the  aggregate  {x^  and  operators  associated  with  it. 
For  example  if  the  particles  are  moving  under  a  common  potential  F,  H  can  be 
written  as 


2m 


f^l  +  V{x,) 


(3) 


(4) 


W  can  be  expanded  in  terms  of  properly  symmetrized  functions  0  as 

'I'  =  Ucn{[ni],f)0l,^ 

where  0^ni}  is  the  configuration  representation  of  a  state  corresponding  to  an  aggre- 
gate of  particles  {?i^j  with  n^  particles  in  the  ^-th  state  (*  =^  1,  2,  .  .  .)  the  total 
number  of  particles  being  n.  c  represents  the  amplitude  for  the  members  of  the 
aggregate  {w^}  being  in  their  respective  states.  The  symbol^  denotes  integration 
or  summation  according  as  the  states  form  a  continuous  or  a  discrete  set. 

The  Schrodinger  equation  satisfied  by  the  wave  function  0  can  only  be  used 
when  the  total  number  of  particles  is  fixed  since  the  Hamiltonian  which  is  a 
function  of  the  coordinates  of  the  particles  can  be  described  only  if  the  total 
number  is  specified.  However  W  as  described  by  (4)  by  very  definition  need  net 


^  It  is  now  an  accepted  notation  to  use  the  same  symbol  for  the  wave  function  of  a  single 
particle  and  the  field  operator. 

2  E.  Fermi,  Elementary  Particles,  Yale  University  Press  (1952). 

3  S.  S.  Schweber,  H.  A.  Bethe  and  F.  de  Hoffman,  Mesons  and  Fields,  Vol.  I,  pp.  145, 
Row  Peterson  and  Co.  (1956). 
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be  confined  to  the  case  of  a  fixed  number  of  particles  since  n  can  vary  in  0„. 
In  such  a  case  if  we  wish  to  write  a  Schrodinger  equation  in  ^^  similar  to  (1), 
then  B.  cannot  be  just  described  through  a  function  of  a  fixed  number  of  variables 
but  in  terms  of  what  are  known  as  field  operators.  We  sh^ll  take  up  the  dis- 
cussion of  the  expansion  in  a  rather  pedagogic  manner  but  in  a  way  which  leads 
us  most  naturally  to  the  operator  calculus  of  field  theory.  We  first  realize  that  the 
states  that  are  usually  referred  to  are  definite  momentum  states  of  an  assembly 
of  non-interacting  particles.  The  ^^«.}  is  the  configuration  representation  of  a 
stationary  state  corresponding  to  the  specified  momenta  of  a  fixed  number  n  of 
free  non -interacting  particles,  stationary  in  the  sense  that  the  only  time  depend- 
ence is  of  the  form  e"*^*  where  E  is  the  total  energy  of  the  non-interacting  system. 
Any  other  time  dependence  of  ^  is  imbedded  in  c.  The  state  vector  W,  assuming 
interaction,  is  the  linear  combination  of  such  0's.  c  in  such  a  case  can  be  more 
explicitly  written  as  c(72^^j ,  ^2^2  •  •  •)'  which  is  therefore  the  many  particle 
analogue  of  the  decomposition  of  a  single  particle  wave  function  into  free  particle 
wave  functions.  Taking  the  single  particle  case 

^(a;)=zz|c(/))e-^^^d3^  (5) 

Q-%vx  jg  ^jjg  configuration  representation  of  a  single  particle  of  momentum />, 
i.e.  it  is  a  degenerate  or  trivial  form  of  the  0  function,  c(^)  the  Fourier  transform 
of  ^  {x)  represents  the  amplitude  that  a  single  particle  has  momentum  />  or  what 
is  the  same  it  is  the  amplitude  in  the  "occupation  number  space".  Therefore  it 
is  the  degenerate  or  trivial  form  of  the  general  c-function  with  n^=^  1,  'p^  ^--  j) 
and  the  rest  of  the  n^  =^  0.  Thus  we  realise  that  the  amplitude  in  the  occupation 
number  space  is  nothing  but  the  momentum  representation  of  W.  Since  the 
number  of  particles  is  greater  than  one,  we  have  assumed  the  possibility  of  more 
than  one  particle  in  a  single  state.  It  is  to  be  noted  that  in  the  many  particle 
case,  W  in  configuration  space  is  the  linear  combination  of  the  wave  functions 
0[ni}^  each  wave  function  for  a  particular  number  of  particles  having  as  its 
arguments  the  corresponding  number  of  space  variables.  How  then  are  we  to 
make  the  transformation  into  the  momentum  space  for  such  a  W^. 

The  peculiarity  of  the  situation  can  be  very  easily  illustrated  even  in  the 
elementary  theory  of  functions.  If  fi(x)  is  a  function  of  a  single  variable  x,  its 
Fourier  transform  is  defined  as 

<fAp)=lfi{x)e*''^dx.  (6) 

If  /g  {x,  y)  is  a  function  of  two  variables  x  and  y  its  Fourier  transform  is  given  by 

9'2(P.?)=//a(^,y)e'^"*"dxd2/.  (7) 

Then  what  is  the  Fourier  representation  of 

F  =  Ui?:)  +  h(x,y).  (8) 

Two  possibilities  arise.  If  we  treat  #  as  a  function  of  both  the  variables  x  and  y 
we  have  to  write  the  double  transformation  as  (p-^if)  d(q)  +  q)2,{'P,  ?)•  The  intro- 
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ductiou  of  the  ^-function  when  the  number  of  variables  is  very  large  becomes 
very  clumsy.  To  avoid  this  we  can  take  q)^  {p)  +  cp^  {p,  q)  as  the  transform  of  F 
with  the  specific  understanding  that  to  obtain  the  inverse  of  F  in  space  represen- 
tation, we  must  take  the  corresponding  inverses  of  (p^^  and  (p^  separately  and 
add  them.  Adopting  this  convention,  (pi{p)  +  (pzip,  q)  is  the  Fourier  representation 
of  F.  This  is  exactly  the  procedure  we  adopt  in  the  theory  of  wave  functions 
and  therefore  identify  ^  c  or  equivalently  c  as  the  momentum  representation  of  W. 
The  c's  are  referred  to  as  state  amplitudes  and  W  as  the  state  vector.  This 
completes  the  analogy  with  the  single  particle  wave  function  (see  chapter  I). 
In  any  quantum  mechanical  problem  we  shall  be  interested  in  the  rate  of 
variation  of  the  state  vector  W.  In  the  one-particle  formaUsm,  this  is  given  by  the 
Schrodinger  equation  and  we  would  like  ¥^  to  obey  a  similar  equation.  However  in 
the  many  particle  case  ^  is  a  linear  combination  of  c's  and  99's  and  dW/dt  cannot 
be  expressed  only  in  terms  of  W.   But  we  could  still  insist  upon  the  form 

^      ^  HW  provided  the  operation  ofHonW  represents  the  required  linear 

combination,  i.e.  H  contains  operators  which  connect  the  ^J,.}S.  These  are  just 
the  annihilation  and  creation  operators  and  the  entire  formalism  of  field  theory 
is  built  round  them  to  such  an  extent  that  it  has  become  conventional  to  start 
with  these  operator  and  the  commutation  relations  associated  with  them  as 
postulates  on  which  the  quantum  field  theory  is  based.  (It  is  noteworthy  that 
workers  in  the  field  do  not  seem  to  adopt  a  uniform  sequence  of  ideas  and  postu- 
lates though  every  one  is  agreed  upon  the  methods  of  calculation  and  convinced 
of  the  equivalence  of  the  different  approaches.)  Even  in  regard  to  the  definition 
of  these  operators,  no  uniform  procedure  is  followed.  For  example,  the  annihila- 
tion and  creation  operators  a^  and  ajare  defined  as  follows: 

On  the  other  hand,  they  are  also  defined  with  respect  to  c  as : 

a\c,,{n^,  n^,--',  Ui,  •  •  •) -^  j/w^  +  lc„  +  i(Wi,  n^,  •  -  • ,  Ui-^l,  •  •  •) .  (9a) 

«iC«(^i,  n^,  ..  .,  rii,  .  .  .)->|/^|C„-i(7ii,  n^,  ..  .,n^-l,  .  .  .).  (9  b) 

To  understand  the  equivalence  of  the  two  definitions  let  us  ask  the  more  com- 
prehensive question :  what  is  the  effect  of  operating  the  creation  and  annihilation 
operators  on  any  W,  where  W  represents  the  linear  combination  of  0«'s,  the 
coefficients  c  being  the  amplitudes  for  the  various  occupation  states? 

Now  alW  naturally  differs  from  W  and  should  imply  the  additional  information 
that  there  is  one  more  particle  with  momentum  k.  To  put  it  in  a  more  mathematical 
form  we  suitably  apply  the  above  definitions  (9)  and  (9  a,  b).  To  obtain  a|^if 
we  just  replace  every  0'^  by  )/%  -j-  1  0*'+^  in  ¥^  the  coefficient  c,,  will  be  attached 
to  0*^  +  ^  which  is  an  inconsistent  notation  though  it  is  meaningful  to  make  the 
8* 


(9) 
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statement  that  the  coefficient  of  0'^  +  ^  in  the  expansion  of  al0  is  Vu/.  H-  lc„  . 
To  get  over  this  difficulty  we  write 

n  n 

where 

d  being  the  Kronecker '  delta  function  of  the  discrete  parameter  k.  We  realize 
that  a^'s  operate  both  an  0„  and  c  and  change  0"  to  O^^  ^  ^  and  c^  to  c'^  +  ^ .  If  we 
take  in  particular 

*'=<^»,),  (U) 

i.e.  all  the  coefficients  c  are  zero  except  c^^^j  corresponding  to  ^^«.\  which  is  unity, 
then  operating  a^  on  W 

al'P  =  al0'>  =  c'„,,{k,---k,„k)0l^f=yn,+  l0"-\       '  (12) 

Hence  in  this  case  it  is  enough  if  we  define  the  operation  of  a  on  0's . 

From  the  definition  of  annihilation  and  creation  operators  through  (9  a)  and 
(9  b)  it  follows  that 


(13) 


which  are  the  commutation  relations  for  bosons.  If  we  define  the  operators  not 
through  (9),  (9  a)  and  (9  b)  but  by  the  equation 


alcn  (%,  n2,  .  .  .,  7^,:,  .  .  .)  =  )/(l  -  71^)  c,,^^  [n^,  n^,  .  .  .,  (1  -  n^), 
«iC,,  (^1,  n^,  .  ..,n,,  .  ..)  =  ]/ni-c„_i  {n^,  n^,  .  .  .,  (1  -  w,),  ..  .) 

we  get  the  anticommutation  relations  for  f ermions : 


(14) 


(15) 


Thus  we  have  introduced  the  concept  of  the  state  vector  in  configuration  | 
and  momentum  spaces  when  we  have  a  changing  number  of  particles  and  thereby 
the  annihilation  and  creation  operators.  We  have  yet  to  arrive  at  the  form  of 
the  Hamiltonian  and  the  definition  of  field  operators.  Before  doing  this  it  will 
be  convenient  to  develop  perturbation  expansions  for  the  Schrodinger  equation, 
and  by  putting  these  expansions  into  correspondence  with  the  wave  function 
formalism  we  arrive  at  the  definition  of  field  operators.  In  adoptirig  this  procedure 
we  have  completely  reversed  the  order  of  the  conventional  presentation  of  ideas  mainly 
to  emphasize  how  a  field-theoretic  description  is  forced  upon  us  in  the  collision  theory 
of  mxiny  particle  systems. 


^  Here  the  arguments  in  the  brackets  refer  to  the  individual  momenta  of  the  particles; 
grouping  the  momenta  yields  the  occupation  numbers  of  (9  a). 
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We  shall  show  in  the  succeeding  chapter  that  in  writing  the  generalized  Schro- 
dinger  equation  the  state  vector  should  be  in  momentum  representation,  i.e. 
described  hy  c(t)  and  hence  we  deliberately  use  c{t)  instead  ofW(t)  as  is  customary. 
In  dealing  with  a  single  particle  system  c{t)  becomes  identical  with  b{t)e~^^^ 
used  in  chapter  II. 

Perturbation  expansion  in  field  theory 

We  are  concerned  with  the  solution  of  the  generalized  Schrodinger  equation 

il^^H(t)c{t)  (16) 

representing  the  variation  of  the  state  vector  c{t)  with  time  t  and  H{t)  is  the 
Hamiltonian  operator  the  form  and  structure  of  which  depends  upon  the  "re- 
presentation of  c".  The  Schrodinger  equation  expresses  a  relationship  between 
states  separated  by  infinitesimal  intervals  of  t,  i.e.  between  t  and  t  +  A,  A  being 
a  positive  or  negative  infinitesimal  quantity.  By  solution  we  mean  the  establish- 
ment of  a  relationship  between  0(^2)  *^^  ^{h)y  h  —  h  being  finite.  It  will  be  con- 
venient for  us  to  express  such  a  relation  in  terms  of  sums  and  integrals.  The 
integral  expansion  can  then  he  exte7ided  to  include  descriptions  of  state,  the  temporal 
variation  of  which  cannot  be  expressed  in  terms  of  a  differential  equation  with  respect 
to  t.  In  such  cases,  we  take  the  integral  relationship  to  be  fundamental  and  in 
any  quantum  mechanical  process  we  compute  c{t^  given  c(t^  under  particular 
cotiditions  and  limiting  processes.  It  is  customary  to  decompose  H  as 

H(t)^H,+  H'{l),  (17) 

where  Hq  is  independent  of  time  and  H*  (t)  is  the  part  which  is  treated  as  a  per- 
turbation and  may  in  general  be  time-dependent.  The  Schrodinger  equation 
is  written  in  the  form 

c(t  +  A)  ^  {1  +  {-i)[H,  +  ir{t)]A}  c{t)  +  0(Z|2),  (18) 

i.e.  given  c{t),  c(t -\- A)  is  obtained  by  operating  the  infinitesimal  operator 
|l  +  {—i)[HQ  +  H'{t)]A}  on  c(t)  and  therefore  to  obtain  the  solution  0(^2)  i^ 
terms  of  c{tj)  for  t^  >  t^  we  iterate  this  operation  which  yields  the  integral 
expansion 

where 

U.it^,  t^)  -  e-*^^«^^^-^^>  +  i—i)  /  e-^^o(«.--)  H'  (t)  e-*^o(--<x)  dr 


c(t2)-UAk,ti)c{t,)  (19) 


+  (-i)2|dT2/e-*'^o(«.-^.)^'(T2)e-*"^o(-.--,)^'(T^)e-*«.^idTi. 

+  ••..    '^         ''  (20) 

We  can  also  write 

UAt2,fi)  =  ^-'''^''-U,{t„t,)e^''^\  (21) 
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whcrG  ^  ^*       ^^ 

Ujih,  h)  =  l+  {-i)JH'i[T)  dr  +  (-^)2/dTl/dTl 

^;(t2)^;(ti)  +  ....  (22) 

Defining  h  it)  through 

^    ''  ^  c{t)  =  e-''otb{t)  (23) 

we  note  that 

^ih)-  Uj{t„t,)b{t,).  (24) 

The  state  vector  can  be  represented  either  by  b{t)  or  c{t).  b(t)  is  called  the  inter- 
action representation  and  c{t)  the  Schrodinger  representation  of  the  state,  both 
being  described  in  the  momentum  space. 

Without  splitting  H  {t)  we  can  write  Ug  {t^ ,  ^i)  as 

Usik,  y  =  1  +  i-i)/  H{r)dT+{-ir  fdr^fdT^Hir^)  H{r^) 

+  •  •  •  +  {-if  jdr,,jdTn-,  •  •  •  /dTi H{rn)  •  •  •  ^(t^)  +  •  •  • .     (25) 

The  most  important  feature  of  this  expansion  is  that  the  time  points  t„  ,  .  .  . ,  r^ 
in  the  integrand  of  n-th.  order,  when  ^2  >  ^i  ?  ^^^  ordered  such  that 

f  2  ^  T^n  ^  T^n  - 1  *  *  '  ^  ^i  ^  ^i  • 

The  infinitesimal  form  of  the  equation  is  also  valid  if  zJ  is  negative,  or  equivalently 

we  can  write 

c{t  —  A)  =  [l-'(-i)H(t)A]c{t)  (26) 

A  being  positive.  This  immediately  implies  that  we  can  equally  well  obtain 
c(t^  in  terms  of  c(^i)  even  if  t^Kt^.  We  obtain  u  {t^,  t^)  for  t^  <  t^  as 

U,{t„t,)  =  l  +  {-i){-l)fH{T)dT-i-{-in-irflH{T^)H{r,)dT,dT, 

ti  tz  ti 

+  •••  +  (-  Ifi-if  jf'  ■  'fH{Tn)  •  •  •  H{T,)  dTi- . .  dr^  +  •  •  •        (27) 
'2  '2        '2 

where  t^  <C  r^  <^  r^^i  •  •  •  <C  t^.  It  is  to  be  noted  that  since  we  chose  A  to  be 
positive,  the  integration  is  from  the  lower  limit  ^g  to  the  upper  limit  t^^.  Instead 
of  having  ( — 1)^  outside  the  integral,  we  can  ascribe  a  factor  ( — 1)  to  each  ^(t). 
Thus  we  see  that  we  have  to  use  different  forms  of  integral  expansions  for  ^2  >  h 
and  for  ^2  <  ^1  •  On  the  other  hand  if  we  wish  to  retain  the  same  form,  i.e.  have  the 
integration  performed  from  t^  to  t^  whether  t^  ^  t^  we  can  drop  the  factor  ( — 1)'* 
and  order  the  variables  r^  .  .  .r^  in  the  direction  t^  to  t^, .  This  mode  of  ordering 
implies  that  if 

^2  >  ^1 '  ^1  <  T^l  <  "f 2  •  •  •  <  '^n  <  ^2 

and  if 

^2  '^  t^  ,  t2  <^  T^  <i  Tff  _i  .  .  .  <C  Tj  <C  Cj  . 
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From  this  point  of  view  it  is  evident  that 

Us{t2,ti)=  U,{t^,t^)u  {t„t,)  (28) 

Us(h,h)=  1=  UAh,to)Us{to,h)  (29) 

^""^  UsihJo)-  U-^(t„t,),  (30) 

We  note  that  such  an  ordering  of  the  c  variables  is  an  essential  consequence 
of  the  differential  equation,  or  put  more  emphatically  if  c  satisfies  a  differential 
equation,  the  time  variables  in  Ug{t2,  t^)  are  to  be  ordered  from  t^  to  t^.  Since  we  shall 
be  requiring  integration  of  the  variables  from  t^  to  t^o,^  t^-^  —  oo  and  ^2  -^  ^"  ^ 
the  ordering  demands 

00  >  T,j  >  T„  _  1  .  .  .  >  T^  >  —  00  . 

Such  an  ordering  is  called  the  P  ordering.  We  can  remove  the  restriction  on  the 
ordering  provided  we  can  introduce  an  operator  which  operating  on  the  product 
H  (r^)  .  .  .  H  (ti)  orders  the  terms  in  a  manner  such  that  the  t  parameter  decreases 
from  left  to  right.  Ugit^,  t^)  for  ^2  >  ^1  can  then  be  A\Titten  as 

/.    r 

Us{t2,  h)  =  e- '  ^o(f.-^^^  +  {-i)l  P\  e-^ ^o(«.-t)  H'  (t)  e-^"^(^-^>  drj 


(31) 


In  any  collision  process  we  are  interested  in  the  matrix  element  bfi{t2,  t^) 
of  transition  from  a  given  initial  state  i  at  tj^  to  a  definite  final  state  /  at  t^  as 

as  ^2  ->  +  00  and  i^  ^  —  00 .  The  general  final  state  b  {t^)  is 

h{t^)'-2^idh,h)h{h)'  (32) 

/ 
It  is  customary  to  write  bfi  as 

bH  =  bfi{k,k)  (33) 

Ltt2-^  +00 

tl-*-  —  00 . 

bf  ^is  the  projection  of  the  state  b{t)  at  t^  on  the  required  final  state  /.  The  method 
of  obtaining  the  matrix  element  bfi  depends  on  the  way  in  which  the  integral 
expansion  is  computed. 

We  shall  first  interpret  the  term  e-*^«^i  b{t{)  with  bj{t^)  =  0  for  all  j  ^  i  and 
bi{tj)  =  1  and  thus  independent  of  t.  0^^^)  is  the  momentum  representation  of  the 
initial  system  of  particles  which  can  be  written  as 

Ci{t,)  =  bMe-'^ot.  =  e-'^ot.,  (34) 

where  Eq  is  the  total  energy  of  the  initial  system.  If  the  initial  state  consists  of 
a  system  of  free  particles,  b  (t^),  as  ^i->— 00 ,  can  be  written  as  a  sequence  of  creation 
operators  independent  of  time  operating  on  a  vacuum,  i.e. 
=  (a  sequence  of  creation  operators)  >o. 
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We  shall  call  such  a  state  |i>.  A  similar  interpretation  is  given  to  the  final  state 
</|.  As  before  it  can  be  expressed  as  a  sequence  of  destruction  operators  in- 
dependent of  time  operating  on  a  vacuum, 

i.e.  </|  =  <(a  sequence  of  destruction  operators). 
Since  the  matrix  element  bf^  can  now  be  written  as 

=-<f\Uj{t^,t,)\iy 

=  <f\s\iy  (35) 

where  S=        Lt  Uj{t^,u;)  (36) 

and  is  knoAvn  as  the  /S-matrix  introduced  by  Heisenberg.  For  convenience  we 

oo 

write  S  =  2]  '^^"'^  where  S^^Hs  the  n-th  order  matrix  element.  Now  we  have  reached 

M  =  0 

a  stage  when  we  can  identify  Hj  in  terms  of  annihilation  and  creation  operators 
and  wave  functions  of  single  particles.  We  shall  do  so  for  the  system  of  fermions 
and  bosons  and  their  interactions.  For  this  purpose  we  shall  take  only  the  first 
order  matrix  element,  i.e. 

</l^<l>|^>^(-^•)</|/^}(T)dT|^>.  (37) 

For  simplicity  we  shall  consider  electron  scattering  in  the  non-relativistic  case, 
i.e.  when  we  postulate  that  the  initial  state  consists  of  one  electron  and  the  inter- 
action is  such  that  there  is  no  annihilation  or  creation  of  positrons.  We  shall 
also  assume  that  the  photon  field  is  classical  and  is  represented  for  simplicity  by 
(ignoring  normalization  and  polarization  factors) 

It  is  clear  that  if  we  choose 

Hi{t)=  [[U  alip,{x)][2]  a,^yjk'{x)]e-'^'-  d^x,  (38) 

J      Jc  k' 

substitution  of  this  and  |«/>o  for  |«">  and  |at>Q  for  |/>,  with  the  use  of  the  result 

alai,.y  -  0     if     k^k' 
-  1    for    k==  k', 

yields  exactly  the  perturbation  expansions  of  chapter  II.  Hj{t)  can  therefore 
be  written  as 

H'iT)  =  lip{x)y){x)e-'^-''d^x,  (39) 

where 

y){x)^  JJajc'yJk'ix)  (40) 

k' 

ip{x)  =  2J4nix)  (41) 

k 
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if{x)  is  then  called  the  field  variable  associated  with  the  fermion.  The  summation 
is  over  all  k  but  only  positive  energy. 

However  this  treatment  is  incomplete  since  we  have  not  taken  into  account 
the  negative  energy  states  in  the  definition  ofy){x)'^.  More  generally  we  can  define 


xp{x)  = 

Zh{k)xpu{x)  +  2Jb{-k)y^_^{x) 

.  k                                      k 

(42) 

ip{x)^ 

y]h'{h)ipk(x)-\-  Zb'{-h)ip_^,{x) 

.  k                                     k 

(43) 

where  +  denotes  summation  over  positive  energies  and  —  the  summation 
over  negative  energies.  Since  there  cannot  be  negative  energy  particles,  we  have 
to  iQterpret  the  annihilation  operator  b( — k)  of  a  particle  of  negative  energy 
as  the  creation  operator  of  an  antiparticle  with  positive  energy;  and  creation 
operator  b'^{ — k)  of  a  particle  with  negative  energy  as  the  destruction  of  an 
anti-particle  with  positive  energy 


and 


6-R(-fc)  =  <&(fc) 

Thus  we  define  a  field  operator  associated  with  a  fermion  field  as 


(44) 
(45) 


W{^)-i:U  KW  u'{k)  e- ^^-^  +  dl[k)  vjk)  e-^*-^ 

k  r=l 


Wi^)  =  S  Z  Ki^)  u'lk)  e^^-^  +  d,{k)  v'{k)  e-^'^-^ 

k   r=l 


(46) 
(47) 


or  more  precisely  in  the  limit  of  continuous  p  values  including  normalization 
factors  and  summation  over  spin  states 


and 


W(^)  =  -^2W^j(~^f\2^^^^  (48) 

ip{x)  =  -(2^/(^f '  ZJKW  u^{k)e'''''-^  d^k)  v^{k)  e-^-jd^A;       (49) 


the  normahzation  being  uu  =  1  and  vv  =  — 1. 

Therefore  the  bilinear  interaction  term  represents  the  following  four  processes 
envisaged  in  the  Feynman  theory : 


1  Till  now  we  were  using  a^{k)  and  a{k)  for  the  creation  and  annihilation  operators  as  the 
discussion  was  general.  As  we  are  now  going  to  consider  the  case  of  the  electron  we  shall  use 
b{k),  b^{k)  and  d{k),  d^{k)  for  the  particle  and  anti  particle  operators  respectively. 
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Photon  emission  with 

(1)  6|  6;r.. -electron  scattering 

(2)  c?|-^^' -positron  scattering 

(3)  6|fl5|.-pair  creation 

(4)  bj^df^'-ipsiir  annihilation 

More  precisely  the  electron- photon  interaction  is  given  by 

H'jit)  =  {4:7i)hfd^xy){x,  t)ry){x,  t)(p{x,  t) 

where  F  =  y^, ,  and  (p{x)  =^  ^^^{x)  is  the  photon  field. 
Any  y)jc  {x,t)  can  be  written  as 


tpk{x,  t)  =  y)k{x)e- 


i(o,.t 


(50) 


(51) 


(I)  (2) 

Fig.  20.  (1)  aat:  electron  scattering; 
(3)  atftt:  pair  creation; 


(3)  (4) 

(2)  bb't:  positron  scattering; 
(4)  ab:  pair  annihilation. 


This  is  equivalent  to  writing  the  operator  ip  {x,  t)  as 

y){x,  t)  =  e^^^o*y){x)e-'^ot, 


where 

This  implies  that 


ip{x)  =  Jlajcipkix) 

k 


aj^e 


ia^,t_QiH,t^^^-iH^i 


(52) 


(53) 


That  this  is  so  is  easily  seen  by  the  following  argument,  e"*^"^  operating  upon 
the  state  vector  in  momentum  representation  yields  the  factor  e~*^'^  where 
£Ji  is  the  energy  of  the  system.  Removing  this  numerical  factor  we  now  operate 
Q%Hot  QYi  a  state  vector  containing  one  particle  less  corresponding  to  momentum  fe, 
since  aj^  has  operated  on  the  state  vector.  Hence  we  get  the  factor  e^*^^'""**^^ 
so  that  we  get  finally  aj^e'^^^^  which  is  considered  as  the  interaction  represen- 
tation of  the  operator  aj^.  In  (38)  we  have  expressed  Hj{t)  in  terms  of  the  creation 
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and  annihilation  operators  and  single  particle  wave  functions.  In  a  similar  manner 
it  is  possible  to  identify  H'.  For, 

H'j{t)  =  e^'>^H'e-'^o\  (54) 

where  ^ 

H'  =  J  ip{x)  y){x)(p{x)  d^x. 

Substituting  for  H'  in  (31)  we  just  obtain  the  expansion  Ug{t2,  t^)  and  the  correct 
relationship  between  Us{t2,  ij)  and  Ui{t2,  tj). 

Now  we  wish  to  work  with  the  interaction  Hamiltonian  H'j{x)  in  computing 
matrix  elements.  Operating  e"*^"''  on  \iy  yields  the  term  e"*^'"'.  But  when  H' 
acts  on  ]*>,  we  get  a  new  system  of  particles  depending  on  the  structure  of  the 
interaction.  The  total  momentum  of  the  new  system  of  particles  is  identical 
with  that  before  the  operation  of  H\  However  the  energy  of  the  new  system 
corresponding  to  the  total  momentum  is  different  from  the  energy  of  the  initial 
system  for  the  obvious  reason  that  we  are  dealing  with  a  different  system  of 
particles.  Hence  the  operation  of  e*^«^  on  the  new  state  yields  e'^^^'^  where  the 
value  E^  will  depend  upon  the  type  of  particles  in  the  system.  If  we  perform  the 

integration  over  r  we  obtain  the  energy  denominator  ~ =- .  The  above  proced- 

ure  can  be  obviously  extended  to  all  orders  and  we  are  led  to  the  old  fashioned 
perturbation  theory.  This  method  of  computing  the  matrix  element  is  based 
on  the  use  of  the  Schrodinger  representation. 

On  the  other  hand,  if  we  work  with  H'j{x)  and  use  the  commutation  rules 
we  will  arrive  at  the  Feynman  expansion  as  shall  be  demonstrated  in  the  suc- 
ceeding chapters. 

Thus  we  have  introduced  the  concept  of  field  operators  by  demonstrating  the 
necessity  of  expressing  the  interaction  Hamiltonian  in  terms  of  annihilation  and 
creation  operators  and  single  particle  wave  functions.  Our  final  task  is  to  identify 
^0  which  should  be  expressed  in  terms  of  field  operators  such  that  Hq  operating 
on  a  system  of  particles  should  yield  its  energy.  Hq  for  a  fermion  field  may  then 
be  written  as 

Hq=  f  d^xip{x){—iY'  f+  m)y){x),  (55) 

where  ip  {x)  and  yj  (x)  have  already  been  defined  and  y  are  the  well-known  Dirac 
matrices. 

Fermion  fields 

The  field  variable  associated  with  each  point  x  is  given  by 

m  Yl, 


^^^^=    (2. 


7iV^-  J  \  E, 


d^p,     (56) 


r  =  \ 

\\'here  h,h^  and  d,  d^  are  the  annihilation  and  creation  operators  of  the  particles 
and  anti-particles  respectively,  u^ip)  is  the  positive  energy  spinor  and  v^{p)  the 
negative  energy  spinor  with  negative  momentum  — j).  Note  that  ^  (a;)  is  a  destruc- 
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tion  operator  for  particles  and  a  creation  operator  for  antiparticles.  In  the 
Schrodinger  representation  the  commutation  relations  are  given  by 

iwM,  Vix')].  =  IwHx),  rp^x')],  =  0  (57) 

and 

[w(x),y.^Hx')]^=-d(x-x').  (58) 

The  expression  for  the  field  operator  in  the  interaction  representation  is  given  by 

1       r       /  m  \^l    ^ 
and    y,(x)  =  -j^^J  d>  [^j  '\l]^{bl(p)  S'(p)  e'>-  +  dAp)  v'ip)  e"'"-^}.    (60) 

The  anticommutation  rules  are 

[WA^)>  ni^')]+  -  {-V+imlipA{x  -  x';  m)  =  -^*Sf^^(x  -  x') ,  (61) 

where  zl  (ic,  m)  is  given  by  (16)  with  m  in  the  place  of  /n  and  ,^,  jS  represent  the 
components  of  a  spinor.  S^^  is  an  element  of  a  matrix  S  where  S  is  given  by 

S{x  —  x')  =  — (iV-f  m)A{x  —  x';m).  (62) 

It  is  easily  verified  that 

{iW—m)S{x  —  x')=^b.  (63) 

We  notice  that  ip  {x)  and  ip  {x)  can  be  split  up  into  positive  and  negative  frequency 
parts  which  correspond  to  destruction  and  creation  operators.  The  anticommu- 
tation relations  between  these  can  be  easily  written  down  as 

[ip^^^x),  y)^-Hx%  -  [f^-^x),  ip^^Hx')]^  =  0  (64) 

[f^^Hx),  y)^^^{x%  =  [f^~Hx),  y)^-Hx%  =  0  (65) 

and 

W  (^),  V*^"^  (^)l  =  -  i  S':^  (^  -  ^')  (66) 

[y,':>lx),  f;'(x')l  =  -  i  Sl:^{x  -  x') .  (67) 

Real  scalar  {neutral)  field 

Let  (p{x)  represent  the  field  variable  associated  with  every  point  x  in  space. 
Now  for  a  real  scalar  field  (p  =  (p*  and  hence  a;^  =  a|.  Thus  we  have 

'""•)  =  -(2^fY2-iw^"*'"'^+"^""'^^-  '''* 

In  this  form  (p  {x)  clearly  satisfies  the  physical  requirement,  namely,  the  creation 
and  annihilation  are  only  of  particles  of  positive  energy.  Similarly 


^<"'n2-^^ 


O^k 


11/ 
^%*[4e-^''-^-aA;e+*''-«'].  (69) 
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Thus  we  obtain  the  commutation  relations 

[(fix),  (fix')]  -^  [7r(ap),  7r(a?')]  =  0  (70) 

and 

[(p{x),7z(x')]=-  id{x  —  x').  (71) 

In  the  interaction  representation 

1         r   d^k 

a>i>0 

Note  that  CO]^  is  always  taken  as  positive  and  thus  the  frequency  dependence 
of  creation  and  annihilation  operators  are  different.  The  commutation  relations 
in  the  interaction  representation  can  easily  be  worked  out  using  the  commutation 
relations  satisfied  by  a{k)  and  a^{k).  Thus 

+  [a];,a^,]e-*'(^-'-^'-^--^>} 

-    2{27trJ      k,^^  e  J 

ki>0 

=  iA{x-  x') ,  (73) 

'^here 

i        1       rd^k 


-i      1      r(rk,    . 


,-ik-(x-x')  _  ^ik-{x-x'n  /74\ 

kt>0 

This  function  A{x  —  x^)  is  invariant  under  translations  and  homogeneous  Lorentz 

transformations,  j  -^ —  is  an  invariant  quantity  under  Lorentz  transformation.  1 

In  order  to  make  the  covariance  more  explicit,  we  remove  the  restriction  on  k^ 
by  using 


d  {B  -  iM^)  =  -^^  [6  (k,  +  ^,)  +  d(lc,-  a.,)} 


(75) 


Thus 

oc 

A{x  -  x')  =  ^^  [ d^ke-^^^''-'''H{B-  /ju')8{k),  (76) 

(271;)'*./ 

-oo 

where  £  (k)  =  jtjt  and  has  an  invariant  meaning.  We  further  note  that  the  commu- 
tator  will  vanish  for  equal  times  which  also  means  that  the  commutator  will 
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vanish  for  space-like  separations  between  the  two  points  x  and  x'.  Now  (pi{x) 
can  be  spHt  up  into  a  positive  and  a  negative  frequency  part,  i.e. 

(pi{x)  =  (p^{x)  +  (p-{x),  (77) 

where 

kt>0 

and 


^''(•^^=l2^/-y2^<-^^''^-  (^^) 


t     rJk-X 

|/2cO;fc 
A-«>0 

The  following  relations  are  then  easily  obtained. 

[(^^+>  {x),  ^<+>  {x')]  =  [(^(->  (^r),  (^(-)  (a:')]  =  0  (80) 

[(p^^^x),(p^-\x')]  =  iA^^\x  —  x')  (81) 

[(^^->(a;),  ^^^>(a;')]  =  *Zl<-)(a;  — x'),  (82) 


where 


and 


^"^      ''^-  2{2nfJ    k, 

kt>0 

A^-Ux      x')—        ^         f^^    +ik.(x 
^     ^"^      ''^-  2{2nrJ    k,  ^ 


(83) 


(84) 


Defining 
and 


^-4  >   0 

A(x-  x')  =  J(+)  {x  -  x')  +  Zl<->  {x  -  x')  (85) 

—iA^^y{x  —  x')  =  A^'-^{x  —  x')-A^-^{x  —  x')  (86) 

it  can  be  seen  that  A^^^x  —  x^)  is  the  vacuum  expectation  value  of 

[(p  {x) ,  cp  (x')]  ^  =  (p{x)(p  {x')  +  (p  {x')  (p(x). 

In  the  above  consideration,  we  have  assumed  that  the  field  is  described  by  a 
real  field  variable  (p.  However  to  describe  charged  bosons,  the  field  variable 
has  to  be  complex.  But  the  quantization  procedure  is  essentially  the  same. 
Two  types  of  creation  and  annihilation  operators  are  defined  a|,  6|  and  %?  ^a- 
for  positively  and  negatively  charged  particles.  The  total  charge  operator  in 
terms  of  creation  and  annihilation  operators  is 

Q=i  e{alaj^.,  +  cij,  4  -  bl  h,,  —  bj,  bl) .  (87) 

2.  NORMAL  PRODUCTS  AND  WICK'S  THEOREM 

We  shall  now  describe  the  method  of  obtaining  the  matrix  element  </  I'S'I  iy 
from  a  given  initial  state  |^>  to  a  final  state  |/>  using  the  /S-matrix  expansion 
given  before.  We  have  already  explained  why  the  states  |*>  and  |/>  should  be 
expressed  as  creation  operators  operating  on  a  bare  vacuum  and  the  field  operators 
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in  the  aS- matrix  in  the  interaction  representation.  The  initial  state  is  represented 
by  \iy  and  the  final  state  by  </  ],  the  latter  therefore  being  expressed  as  a  sequence 
of  annihilation  operators  operating  on  the  vaccum  to  the  left,  i.e.  on  <  |.  Our 
method  should  be  such  as  to  yield  exactly  the  same  matrix  element  as  is  obtained 
through  the  Feynman  formalism.  That  is,  for  example  in  electrodjoiamic  processes, 
it  should  be  expressed  in  terms  of  the  momentum  representation  of  the  incident 
and  emergent  particles  and  of  the  propagators  corresponding  to  intermediate 
states.  The  integrands  in  the  /S-matrix  can  be  expressed  through  the  field  operators 
and  each  field  operator  consists  of  two  terms,  one  containing  annihilation  operators 
and  the  other  creation  operators.  The  calculation  of  matrix  elements  consists 
in  rearranging  the  terms  in  such  a  manner  as  to  reduce  them  to  a  product  of 
single  particle  propagators  in  configuration  space  and  the  wave  function  corres- 
ponding to  the  incident  and  emergent  particles.  The  integration  with  respect 
to  space  and  time  variables  puts  the  propagators  in  momentum  representation 
and  the  desired  matrix  element  is  thereby  obtained. 

A  single  particle  propagator  connecting  two  space  time  points  implies 
the  creation  of  a  particle  at  one  point  and  annihilation  at  the  other  and  thus  is 
identified  in  the  rearrangement  if  a  creation  operator  with  the  wave  function 
associated  with  it  is  placed  to  the  right  of  an  annihilation  operator  with  the 
corresponding  complex  conjugated  wave  function.  The  only  sequence  of  terms 
which  therefore  does  not  contain  a  propagator  is  one  which  contains  only  creation 
operators  or  only  annihilation  operators  or  all  the  annihilation  operators  to  the 
right  of  the  creation  operators.  Such  a  sequence  is  said  to  be  in  the  "normal 
product"  form.  Thus  it  is  clear  that  given  any  sequence  of  terms,  they  can  be 
rearranged  and  expressed  as  a  product  of  propagators  and  a  normal  product. 
Remembering  that  |^>  and  </ 1  are  represented  as  creation  and  annihilation 
operators  acting  on  the  vacuum,  it  is  easy  to  see  that  the  normal  product  has  terms 
with  annihilation  operators  to  the  right,  and  these  taken  together  with  |*>  on 
the  right  represent  the  incident  particles;  the  terms  with  creation  operators 
in  the  normal  product  taken  with  </ 1  on  the  left  represent  emergent  particles. 
The  incoming  and  emergent  particles  are  represented  by  external  lines  in  the 
Feynman  diagram.  It  is  quite  obvious  that  the  expectation  value  of  a  normal 
product  between  vacuum  states  is  zero  since  an  annihilation  operator  acting  on  a 
vacuum  yields  zero.  We  now  proceed  to  demonstrate  the  mode  of  reduction  of  the 
*S'-matrix  into  propagators  and  normal  products. 

For  this  purpose  we  have  to  introduce  the  Wick's  chronological  operator  T 
and  the  normal  product  operator  N  as  follows : 

T{y){x)y){y))  =  +y){x)ip{y),  x^>y^  (1) 

for  fermions.  Using  the  commutation  relations  for  the  ^'s,  i.e.  [\p,  ip]+  =  0  we 
have  for  all  times 

T{,p{x)^p(y))  =  y^{x)^,{y).  (2) 
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Similarly  we  have 


T(y){x)  tp(y))  =  +xp{x)  xp{y),  x^  >  y^ 


-  —y){y)rp{x),  2/4  > 


(4) 


For  bosons 

T((p{x)(p(y))  =q){x)(p{y),       x^>y^ 

=  +(piy)(p{x),  2/4  >  ^4- 

Thus  the  T-operator  is  identical  with  the  Dyson  P  operator  with  the  addition 
of  a  minus  sign  in  the  case  of  an  interchange  of  fermion  field  operators. 
The  normal  product  operator  N  is  defined  as 

N{y)^^^x)ip^~^{x))  =  -N(^p^-^{y)^p^■'\x)) 
N(yj^^^  (x)  v^(^>  iy))  =  -N(y)^^^  (y)  ip^^^  (x)) 


and 


(5) 


.(+)- 


y)^^^  (x)  y)"^^^  (y)  =  —  yj^-"^  {y)  ^ +>  (x) 


,(+)- 


.(+), 


for  fermions.  For  bosons,  we  have 

N{(p^-\y)  (p^^\x))  =  N((p^^^x)  (p^'Hy))  =  ^^~\y)  (p^''\x) 
The  /S-matrix  is  given  by 


/-v  +  00  n  =  0 


S=  Lt 


n 


iJd^i  .  .  -Jd^XnPiHjix^),  .  .  .,  Hi{Xn))\ 


(6) 


(7) 


(8) 


where  ^j(a;J  =-- yj(a:^)  A(a;^)^(a;„)  in  quantum  electrodynamics.  We  note  that 
there  is  an  ambiguity  in  the  ordering  of  operators  in  the  term  corresponding  to  a 
vertex.  To  remove  this  ambiguity  we  shall  examine  the  meaning  of  the  term, 

ip^^\x)  yj^^\xp 
+  y}^^\x)y)^-^x) 
-i- y)^-\x)  y)^^\x) 
_+ y}^-^ (x)  y)^-^ (x) 


y}{x)yj{x) 


(9) 


Only  one  of  these  terms  has  the  creation  operator  to  the  right  of  the  annihilation 
operator.  Treating  ip^"^^ (x)  y)^-\x)  as  the  limit  of  ^<+^  (?/)  ^<">  (a)  as  y  ^  x,  i.e. 
with  2/4  —  iC4  ->  +  0  we  note  that  the  vacuum  expectation  value  of  ip^^^{x)  yj^'^x) 
is  the  limit  of  a  propagator  which  is  infinite.  On  the  other  hand,  if  we  put  the 
annihilation  operator  to  the  right  of  the  creation  operator,  it  will  not  represent 
the  limit  of  a  propagator.  Therefore  assuming  the  interaction  to  be  represented  in 
this  form  would  imply  that  we  are  taking  it  in  the  normal  product  form.  This 
difficulty  of  the  vacuum  expectation  value  of  the  current  becoming  infinite 
and  the  necessity  for  the  modified  expression  were  realized  by  Heisenberg^  as 
early  as  1934. 


1  Heisenberg,  W.,  Z.  Phys.,  90,  209  (1934);  92,  692  (1934). 
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Thus  taking  the  current  in  the  normal  product  form  as 

j^{x)  -  eN(ip{x)y^,y){x)), 
we  can  write  the  /S-matrix  as 
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(10) 


s=2 

n  =  0 


{-iey 


/  d%i  •  •  •  /  d^Xn 


T{N(ip{x,)  A{x,)ip{x,)) . . .  N(ip{Xn)A{x,,)  y,{Xn))} .       (U) 

Since  ^^  {x)  commutes  with  ip  {x)y^y)  {x)  the  relative  ordering  of  ^^,  {x)  and^  (^)yfiW(^) 
is  irrelevant  and  hence  we  have  included  the  A^  also  in  the  normal  product. 

We  shall  first  prove  that  a  jT-product  for  two  operators  can  be  uniquely 
decomposed  into  a  normal  product  and  a  propagator.  We  have 


^(^«  (^)  y>p  iy)  —  ^  iwcc  {^)n  iy) )    ^^^    ^4  >  2/4 


wii^)f}iyy 
+vi{^)niy) 

+  y)~{x)y)^{y) 

+  w^i^)niy). 


N 


+ vtc  (^)  n  (y) 

+  Va{x)y)^{y) 


(12) 


=  [rA^),  niy)l  =  -^  ^Ui^-y)  ■    (i3) 


wl{x)fHy)l    r   wii^)y^piyy 

+  '(pl{x)fpp{y)     _     -y)p{y)iptc{x) 

+  yj-{x)ip^{y)  +W~A^)wtiy) 

_-\-y)l{x)ipp{y)\      V  +  ip-Ax)T^{y). 

Por  y^  >  x^ 

T(wA^)ipp{y))  -  N(y^^{x)ip^{y))  ^  -T(tp^(y)xp^^(x))  -  N(y,,(x)ip^{y)). 

y^Hy)'^i(^)l     r   wi{x)9p{yr 
+wMy)w^i^)  _   +w^{^)wMy) 
+niy)vt{^)        -n{y)Wa{^) 
+ w  iy)  w^  (^)  J     L + ^«  (^)  v'iS  (2/). 

=  -  [v^^  iy),  wl  (^)]+  -  [wt  (2/)'  v^«  (^)]+ 
-te(2/).  ^«W]+ 

=  +  *>S;^(a;-2/). 
Thus  for  all  times  we  have 

T(wAx)xpp{y))-^iWo.{x)y^^{y))  =  -"-U^Fapi^  -  y)- 

EPCR      9 


(14) 


(15) 
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The  difference  of  the  T  product  and  the  N  product  is  defined  as  the  contraction 
Wa.  (^)V'i8  {y)-  This  has  been  shown  to  be  the  propagator.  Similarly  defining  the 
contraction  for  bosons  as  cp'  {x)  (p'  (y),  we  can  show  that 

(p'{x)(p'{y)  =  {i)A^{x-y)     for     x^>y^  (16) 

=  {-i)A-{x-y)     for     y^>  x^  (17) 

=  \  Af{x  —  y)     for  all  times.  (18) 

We  must  now  prove  that  by  the  application  of  this  rule  the  entire  T  product 
of  a  number  of  operators  can  be  decomposed  into  normal  products  and  propa- 
gators. This  is  known  as  Wick's  theorem  which  states  that  a  T-product  can  be 
decomposed  into  a  unique  sum  of  normal  products  of  the  form: 

T{UV  ...,XYZ)  =  N{UV  ...,XYZ)  +  N(U'V'  ...,XYZ)         (19) 

+  .-.  +  i\^(t7-F"...,X-7Z*)  +  .... 

where  the  following  notation  has  been  used ;  the  first  term  represents  the  normal 
product  of  all  the  operators,  the  second,  the  normal  product  of  operators  in 
which  any  one  pair  has  been  contracted,  the  third  in  which  two  pairs  are  con- 
tracted, and  so  on  until  all  possible  combinations  are  exhausted.  The  proof 
follows  from  induction. 

Proof  of  Wick's  theorem 

To  prove  the  theorem  by  induction,  we  note  that  it  is  true  in  the  case  of  a  single 
factor  and  also  for  two  factors  as  in  the  latter  case  it  merely  gives  the  definition 
of  the  contraction  of  two  operators  in  terms  of  the  T  and  N  products  (15).  Let  us 
assume  that  it  holds  for  n  factors.  Multiplying  the  left-hand  side  of  (19)  on  the 
right  by  an  operator  Q  labelled  with  a  time  earlier  than  any  other  factor,  we  have 

T(UV  .  ..XYZ)Q=^  T{UV  .  .  .XYZQ)  (20) 

The  terms  on  the  right-hand  side  of  (19)  will  be  of  the  form 

N{UV  .  .  .XYZ)Q. 
Now  if  we  are  able  to  decompose  each  one  of  these  in  the  following  way, 
N(UV  .  .  .XYZ)Q=  N{UV  .  .  .XYZQ) 
+  N(UV  .  ..XYZ'Q') 
+  N{UV  ...XY'ZQ') 
+  .  .  . 
+  N(U'V  ...XYZQ')  (21) 

then  the  sum  of  the  terms  on  the  right-hand  side  so  obtained  will  evidently 
correspond  to  the  right-hand  side  of  (19)  written  for  t^  +  1  factors,  if  ^  is  earUer 
than  UV...X  YZ.  (This  restriction  is  however  removed  once  Q  is  inside  the  T 
and  N  products  since  these  remain  by  definition  unchanged  for  the  same  re- 
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grouping  of  factors  on  both  sides  of  the  equation.)  Thus  we  will  have  proved 
Wick's  theorem  if  we  prove  (21).  For  this  purpose,  we  again  have  to  take  recourse 
to  induction.  (21)  is  true  for  a  single  factor  in  N  since  then  it  is  merely  the  de- 
finition of  contraction  of  two  operators  given  earlier.  Let  us  assume  now  that  it  is  true 
for  n  factors,  that  ^  is  a  creation  operator  and  that  all  the  operators  UV...  XYZ 
are  annihilation  operators.  li  Q  is  an  annihilation  operator  and  UV  .  .  .  XYZ 
contain  both  creation  and  annihilation  operators,  then  (21)  would  become  a 
trivial  identity  since  the  contraction  of  two  annihilation  operators  is  zero  as  also 
that  of  an  annihilation  operator  whose  time  label  is  earlier  than  that  of  the 
creation  operator  with  which  it  is  contracted.  Further  if  Q  is  a  creation  operator 
as  we  have  assumed,  we  may  add  any  number  of  creation  operators  on  the  left 
of  all  the  factors  inside  the  N  products  on  both  sides  of  equation  (21)  and  the 
theorem  would  still  be  valid  since  the  contraction  of  two  creation  operators  is 
zero.  Thus  our  assumption  that  jQ  is  a  creation  operator  earlier  than  that  of 
any  of  the  annihilation  operators  UV  .  .  .  XYZ  is  sufficient  for  our  purpose. 
Multiplying  (21)  by  an  annihilation  operator  D  which  has  a  time  later  than  Q 
on  the  left,  we  get 

DN{UV  .  .  .XYZ)Q=  N{DUV  .  .  .XYZ'Q')-^  .  .  .  +  N{DU'V  .  .  .XYZQ') 

+  DN{UV  ...XYZQ).  (22) 

Further  since 

DN(UV .  .  .XYZQ)  =  dp  DQU  .  .  .XYZ  (23) 

(dp  being  the  sign  of  the  permutation  for  fermion  factors)  and 

DQ  =  T(DQ)  -  D'Q'  +  N{DQ)  -  D'Q'  +  SqQD,  (24) 

(where  ^q  =  di  1  respectively  for  boson  and  fermion  factors) 

DQ  UV  ...  XYZ  ==D'n'UV  ...XYZ  +  SqQD  UV  ...XYZ 

=  dp  N{D'  U  V  .  .  .  XYZQ.)  +  dQdpN{D  UV  .  .  .  YZQ) ,  (25) 

so  that  we  have  finally 

DN{UV  -XYZQ)  =  d^DQUV-   XYZ 

=  d%N  {D' U  V"  -  XYZ  Q)  +  dlS%N{DUV'-- XYZQ).      (26) 

Noting  that  Sq  ^  dp  =  1,  we  see  that  the  theorem  (21)  is  true  for  w  4-  1  factors 
and  hence  by  induction  is  true  in  general. 

A  mixed  T  product  is  defined  to  be  a  T  product  in  which  normal  products 
occur.  We  have  such  mixed  products  in  the  terms  of  the  >Sf-matrix.  Wick's^ 
extension  of  his  theorem  to  these  products  states  that  they  can  also  be  decom- 
posed into  a  unique  sum  of  normal  products  as  in  (19)  provided  we  do  not  contract 
factors  which  are  already  in  normal  product  form. 

To  prove  the  theorem  we  first  observe  that  the  factors  inside  each  of  the  normal 
products  occurring  in  the  mixed  products  of  the  /S^-matrix  expansion  bear  the 

ime  time  label  as  mentioned  earlier.  Let  us  split  each  of  the  operators  ip,  y> 

1  G.  C.  Wick,  loc.  cit. 
9* 
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and  A  occuring  in  the  mixed  T  product  into  creation  and  annihilation  operators 
and  consider  the  mixed  T  product 

T(N.{ABC).  .  .N{XYZ)), 

where  ABC,  XYZ  are  either  creation  or  annihilation  operators,  ABC  having  the 
same  time  label  and  similarly  XYZ.  We  can  think  of  it  as  the  limit  of  T  (ABC  .  .  . 
XYZ),  where  the  time  label  of  the  creation  operators  amongst  ABC  .  .  .  XYZ  is 
assumed  to  be  later  than  that  of  the  destruction  operators  by  an  infinitesimal 
amount.  We  can  now  apply  the  theorem  (19)  and  find  that  all  contractions  amongst 
the  operators  ABC  .  .  .  etc.  vanish  (since  the  creation  operators  have  a  later  time 
than  the  annihilation  operators)  which  is  what  the  theorem  states. 

3.  REPRESENTATION  OF  INTERACTIONS  IN  FIELD  THEORY 

We  have  seen  how  the  interaction  between  particles  in  the  field  theoretic 
formalism  is  expressed  through  the  inclusion  of  an  interaction  term  in  the 
Lagrangian  or  Hamiltonian  density.  This  term  representing  the  interaction  density 
at  a  space-time  point  x  consists  of  a  product  of  field  operators  corresponding  to 
the  same  space-time  point  x  (neglecting  for  the  moment,  numerical  factors, 
differential  operators  and  matrices)  or  in  other  words  the  interaction  is  local  and 
represents  the  creation  and  annihilation  of  particles  at  the  vertex  x.  The  inter- 
action between  two  particles  when  they  are  separated  by  a  finite  distance  is 
interpreted  as  due  to  the  action  of  a  mediating  field,  the  quanta  of  which  are  ex- 
changed between  the  two  particles,  i.e.  emitted  by  one  and  absorbed  by  the  other. 
In  this  picture,  the  basic  interaction  is  only  between  the  particle  and  the  quanta 
a-nd  is  therefore  local.  The  advantage  of  postulating  such  local  interactions  lies 
in  the  fact  that  the  interaction  term  in  the  Lagrangian  can  be  made  relativistically 
invariant.  This  condition  and  other  considerations  of  symmetry  and  invariance 
impose  restrictions  on  the  nature  of  the  coupling  between  the  various  fields. 
The  strength  of  the  interaction  is  measured  by  the  magnitude  of  a  numerical 
multiplicative  factor  g,  the  coupling  constant. 

It  is  customary  to  divide  interactions  into  two  groups  according  to  the  natiu'e 
of  the  coupling  between  the  fields :  (1 )  the  direct  coupling  in  which  the  field  operators 
occur  as  such,  and  (2)  the  coupling  in  which  the  derivatives  of  the  field  operator 
occur.  The  actual  interaction  term  should  be  hermitian  and  should  have  definite 
transformation  properties  which  necessitates  the  introduction  of  suitable  matrices. 

Direct  couplings :  Since  the  only  known  particles  are  either  bosons  or  fermions, 
the  most  general  term  for  the  interaction  (ignoring  the  y-matrices,  the  choice 
■of  which  depend  on  the  transformation  properties  of  the  fields)  is 

g{ipy))^(p^ 

where  F  and  B  denote  the  number  of  fermions  and  bosons  at  a  single  vertex. 
The  fermion  field  ip  occurs  in  a  bilinear  manner  since  fermionS  are  emitted  or 
absorbed   only  in  pairs.  The   coupling  constant  g  giving  the  strength  of  the 
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interaction  should  preferably  be  a  dimensionless  number.  In  a  general  type  of 
interaction  this  need  not  necessarily  be  true.  To  determine  the  dimensions  of  g^ 

we  find  that  since 

[m^9?^]  =  iy  *     and     \my)ip]^L~^ 

[(p]  =  L-^     and     [xp]  =  L-'l- 
(in  units  of  ^  =  c  =  1) 
we  have  {g^L'^'^^  =  L~^ , 

i.e.  [^]=:L-4+^+3i^, 

The  tables  give  the  dimensions  of  g  for  various  processes  and  the  invariant  inter- 
action term  for  the  various  transformation  properties  of  the  cp  field.  The  combi- 
nations of  the  y-matrices  for  direct  and  derivative  couplings  are  different  due  to 
the  introduction  of  the  differential  operator.  The  coupling  constant  /  in  the  latter 
case  has  to  be  divided  by  a  mass  //  to  make  it  dimensionless. 


Symbol 

B 

F 

-4  +  B  +  SF 

tptptp 
<p^ 

1 
3 

1 
0 
0 

0 
0 

-1 

ipy)(p^ 

(v  w)^  (P 

2 

0  1 

1  ! 

1 

2 

2 

+  1 

2 
3 

Transformation 

The  invariant  interaction  term 

of  qp  field 

Direct  coupling 

Derivative  coupling 

Scalar 

Pseudoscalar 

Vector 
Pseudovector 

gyjyxp 
gipy^tpcp 

/    -            Q<P 
if                   dw 

4.  THE  HEISENBERG  REPRESENTATION  AND 
OPERATORS  IN  FIELD  THEORY 

The  Heisenbeig  representation^ 

In  the  Schrodinger  representation,  the  spatial  coordinates  are  treated  on  a 
different  footing  from  the  time  coordinate,  whereby  the  manifest  relativistic 
in  variance  of  the  theory  is  lost.  In  many  cases  however  it  is  useful  to  employ 

^  This  chapter  is  largely  based  on  Kallen's  article  in  Handbitch  der  Physik,  Vol.  5,  Part  I, 
Springer- Verlag  (1958).  Since  it  is  impossible  to  improve  upon  the  terseness  and  accuracy 
of  Kallen's  style,  the  author  has  attempted  to  adhere  strictly  to  his  sequence  of  ideas.  Here 
we  are  using  the  symbol  tp  (t)  instead  of  c  (t)  as  we  should  in  conformity  with  the  earlier  notations 
of  the  previous  chapter.  However  we  prefer  in  this  chapter  to  retain  rp  (t)  since  we  are  following 
closely  Kallen's  treatment,  in  which  the  notation  xp{t)  has  been  adopted. 
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the  relativistic  invariance  to  prove  general  theorems  and  under  certain 
circumstances  it  may  be  necessary  to  use  such  invariance  considerations  to 
give  a  meaning  to  quantities  which  are  mathematically  undefined.  In  such 
cases  it  is  more  suitable  to  use  the  Heisenberg  representation  in  which  there  is 
complete  symmetry  in  all  the  four  coordinates. 

The   Heisenberg  representation   of  a  state  vector  Wji  is  obtained  from  the 
Schrodinger  representation  Wg  by  the  transformation 

W^(t)^ei«tWs(l),  (1) 

where  H  is  the  total  Hamiltonian.  Since  the  Schrodinger  equation  for  ^^(0  has 
the  solution  ¥^^(0  =  e~^^^W(0),  we  recognize 

¥'hW  =  'Ps(0)  (2) 

for  all  t;  i.e.  the  Heisenberg  state  vector  is  independent  of  time  and  coincides 

with  the  Schrodinger  state  vector  at  ^  ==  0.  Hence  it  obeys  no  equation  of  motion. 

However,  the  Heisenberg  representation  Fjj  (t)  of  an  operator  Fg  in  the  Schrodinger 

representation  is 

FhH)  =  e'"'  Fse-'"'.  (3) 

Differentiating  both  sides  with  respect  to  t,  we  have 


dt 


=  i[H,FH],  (4) 


which  can  be  interpreted  as  the  equation  of  motion  of  the  operator  Fjj .  In  parti- 
cular if  we  choose  the  operator  F  to  be  the  Hamiltonian  H,  we  find  that  the 
Hamiltonians  in  both  the  representations  are  identical  and  time  independent. 

The  definition  of  the  transformation  implies  the  choice  of  zero  as  the  point 
where  the  two  representations  coincide.  More  generally,  any  point  can  be  chosen 
for  such  coincidence  and  in  collision  problems  it  has  become  customary  to  choose 
a  point  which  is  made  to  tend  to  —  oo.  This  limiting  process  obviously  introduces 
mathematical  difficulties  but  the  physical  interpretation  is  made  easier  if  the 
two  representations  are  made  to  coincide  "initially,"  and  as  has  been  shown  in  the 
chapter  on  the  formal  theory  of  scattering,  it  is  convenient  to  interpret  "initial" 
as  corresponding  to  the  point  tj^  =  —  oo. 

It  can  be  readily  established  that  if  we  set  up  a  field  theory  derived  from  a 

Lagrangian  density  function  of  the  type^  Lycpcc  {^)^  — ^ )>  then  the  Heisenberg 

equations  of  motion  for  the  field  operators  (p^  {x)  and  the  conjugate  momentum 
operators  7r^(a;)  are  identical  with  the  equation  defining  n^{x)  in  terms  oi  (p^{x), 
i.e.  r 


^  From  now  on  in  this  chapter  we  drop  the  suffix  H  since  we  will  be  working  only  in  th© 
Heisenberg  representation  unless  otherwise  specified. 
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and  the  Lagrangian  equation  of  motion 


y     S  dL  dL 


//=!  dx, 


respectively,  provided  the  latter  is  considered  as   an  operator  equation  and  we 
impose  the  canonical  commutation  relations 

[n^{x),  (pp{x')\  =  —i  d^^d{x  -  X') 

KM,  ^)5(^')]  =  b«(^),  9^y3(^')]  =  0,  (7) 

which  are  considered  postulates  of  the  theory.  Further  if  as  in  classical  theory, 
the   three  spatial    components  of  the  momentum  are  defined  by  the  equation 


Pic^-U^xZn^ix) 

J  a 


^ff.  (^] 


(8) 


dxj, 
then  it  follows  for  any  operator  F(x)  constructed  from  (p^(x)  and  7i^{x)  that 

This,  together  with  equation  (4)  forms  the  complete  system  of  equations  for  the 
operators  in  the  Heisenberg  representation.  If  we  set  P^  =  H,  then  the  four 
equations  can  be  written  as 

i[P,,Fix)]='^.  (10) 

P^,  is  called  the  space-time  displacement  operator  for  obvious  reasons. 

The  operator  equation  is  equivalent  to  stating  that  the  matrix  element  between 
two  states  of  i  \P^ ,  F  (x)]  is  equal  to  the  differential  coefficient  with  respect  to 
the  variable  x^^  of  the  matrix  element  ofF(x)  between  the  two  states.  In  particular,, 
if  we  choose  the  two  states  |a>  and  \by  with  definite  four-momenta  ^Jf^  and  ^Jf^ 
respectively,  we  get 

i<a\[P^{x),F{x)]\by  =  i{pl^^-p<^^)<a\F{x)\by  =  -—<a\F{x)\by         (11) 

cx^ 

the  solution  of  which  is 

<aiJP(^)|6>  =  <a|J'(0)]6>e'f '^^'"~^^'"''^''.  (12) 

Thus  we  obtain  a  relationship  between  the  matrix  elements  of  the  operators  F  [x) 
and  F{0)  or  more  generally  ^(a;^)  andP(a;2)  respectively.  This  principle  of  trans- 
lational  in  variance  of  a  Heisenberg  operator  is  used  extensively  in  calculations. 
We  shall  now  study  the  problem  of  the  interaction  of  the  electromagnetic  field 
with  a  given  classical  current  distribution  ^^  {x)  as  this  will  enable  us  to  introduce 
certain  concepts  and  definitions.  From  the  Lagrangian  for  this  problem 

i=-l^,.^,,_iM.^  +  ^.,;,  (13) 
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where  the  first  two  terms  of  the  right-hand  side  form  the  Fermi  Lagrangian  for 
the  free  electromagnetic  field  and  the  last  term  represents  the  interaction 
Lagrangian,  we  obtain  the  equations  of  motion  for  the  field  operators  A^^  as 

O  A^,(x)  =  j^{x)  (14) 

and  the  Hamiltonian  operator  as 

''^'I^M^^^'A  ^]  -/d3./.(x)^„(x).  (15) 

The  solution  A^^  (x)  can  be  written  in  the  form 

A^{x)  =  A'j^^x)  +fDjt{x  -  x')  j,{x')  dx' ,  (16) 

where  A^^\x)  is  just  the  initial  value  of  A^X^),  i.e.  at  ^  =  —  oo,  and  it  is  assumed 
that  j^{x)  vanishes  as  ^  ->  —  oo  in  such  a  manner  that  the  integral  in  (16)  exists^. 
A^^^{x)  satisfies  the  homogeneous  equation 

U\Af^{x)  =  0.  (17) 

Since  the  integral  is  a  c-number,  A^^^  {x)  satisfies  the  same  commutation  relations 

as^^(a;): 

[Af\x),  A^^){x')\  =  -id^,D{x'-x)  (18) 

<0|{4o)(a:),  A^^^x')}\^y=:^d^,D^{x'-x).  (19) 

A^^^(x)  is  called  the  "incoming"  field.  On  calculating  the  energy  with  the  above 

solution  and  assuming  that  the  current  vanishes  also  at  ^  ->  +  oo,  it  can  be  shown 

that 

lim^  =  ^o(J(?)).  (20) 

If  we  choose  the  state  vector  at  t  =  — oo  to  be  the  eigenfunction  of  HQ{A^^),  the 
state  vector  has  particles  corresponding  to  the  "incoming"  free  particles.  Since  we 
have  assumed  that  j^i{x)  vanishes  at  ^  =  +  oo,  we  can  also  write  the  solution  in  the 
form 

A^{x)  =  Al^){x)  +  j  DA{x-x')j^{x')dx'  (21) 

where  the  operator  ^Ijf ^  [x)  corresponds  to  the  value  of  A^  {x)  at  ^  =  +  oo  and 
also  satisfies  the  homogeneous  equation 

UA^^\x)  =  0.  (22) 

As  for  ^0,  if  it  is  expressed  in  terms  of  A^^\x),  as  ^  ->  +  ^  the  eigen  functions 
of  this  would  have  free  particles  corresponding  to  the  "outgoing"  state.  Since 
we  are  interested  only  in  transitions,  the  state  vectors  for  the  "incoming"  and 
"outgoing"  cases  would  be  different.  From  their  very  definitions,  the  relation 
between  the  incoming  and  outgoing  fields  is  given  by 

A^^\x)  -  A^^\x)  =  JD{x-  x')  j^{x')  dx'.  (23) 

^  For  an  alternative  definition,  see  chapter  II,  section  1. 
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In  the  above  discussion,  we  have  assumed  j^^  to  be  a  classical  current  but  when 
we  are  interested  in  a  quantized  charged  fermion  field,  then  j^  (x)  will  have  to  be 
expressed  in  terms  of  fermion  field  operators. 


Interaction  of  fermion  and  electromag'netic  fields 

We  write  the  Lagrangian  for  this  system  as 

L=L^  +  L^  +  L,^^,  (24) 

where  Ly,  and  L^  are  the  free  field  Lagrangians  for  the  two  fields  and  Zj^t  is  given  by, 

Lmt  =  Ai^)Ju{^)  (26) 

g 

the  fermion  current  in  this  case  being  the  commutator,  -^  [ip,yf^ip]. 
The  corresponding  equations  of  motion  are 


l  —  iy ^  m\  ip{x)  =  e  y  A  {x)  y) {x) ^ 

n  A^ (x)  =  —  [ij)(x),y^y) (x)]  ^ j^ (x) . 


(26) 
(27) 


Since  the  term  L■^^^^_  does  not  contain  the  time  derivative  of  the  field  operators, 
the  canonical  momenta  will  be  the  same  function  of  the  dynamical  variables  as 
before  and  the  commutation  relations  for  equal  times  can  be  written  down  im- 
mediately : 


\A^{x),  A^{x  )\x^  =  x\  — 


dAf^     dAf^ 


dxA 


S< 


0 


d  x^ 


X4=X, 


Ai^') 


d^^d{x  —  x') 


{ip{x),  y){x')}^^^^'^  =  y^d{x-  x') 


(28) 

(29) 

(30) 
(31) 


[A^{x),y){x%^^^'^ 
[A^{x),ip{x%^^^'^ 


^     ,  y>(x') 


=  0 


-a^-'  ^'^'> 


=  0. 


(32) 
(33) 


dx^ 

dA^ix) 

While  the  commutator  for  arbitrary  times  for  free  fields  can  be  obtained  from  the 
commutator  at  equal  times,  this  cannot  be  done  if  we  include  the  interaction 
since  the  commutators  do  not  satisfy  simple  equations  of  motion.  The  differential 
equations  of  motion  can  be  transformed  into  corresponding  integral  equations 

y,{x)  =:Y'>\x)  -  j  SR(x  —  x')eyA{x')y){x')  dx'  (34) 


A^{x)=^Af){x)  +jDn{x-xnj[ip{x'),  y^y^{x')]d. 


(35) 


6 

^  Here  we  are  using  rationalized  units  of  charge,  — —  =  Via?* 
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^(0)^  ^(0)  aj^(j  ^(0)^  ^(0)  are  identified  with  the  "in"  and  "out"  fields  respectively. 
It  is  to  be  noted  that  in  the  previous  case,  since  we  had  assumed  a  classical 
current  j^ ,  (16)  represented  a  solution  for  Af,{x)  while  in  this  case  A^^  and ^  satisfy 
coupled  integral  equations  which  are  yet  to  be  solved.  The  solution  is  not  very 
easy  and  suitable  perturbation  methods  should  be  devised.  It  is  interesting  to 
note  that  the  perturbation  theory  here  applies  to  operators  while  in  the  Schro- 
dinger  and  interaction  representations  we  were  applying  it  to  the  state  vectors. 
The  most  obvious  way  of  applying  perturbation  theory  to  such  integral  equations 
is  to  write  ip  (x)  and  A  (x)  as 

y){x)  -  tp^^^x)  +  ey)^^\x)  -f-  e^tp^^^{x)  +  .  .  .  (36) 

A^  {x)  =  ^0)  {x)  +  e  ^(1)  (x)  +  e2  ^(2)  (x)  +  . . . .  (37) 

By  substituting  these  in  the  equations  of  motion,  we  obtain  the  recurrence 
relations  for  the  different  approximations  as 


y,n^^^)  =  -1/^  fSnix  -  x')  y,  Z{A^r\^)\  ^p^''-'^\x')\^x 

J  TO  =  0 


(38) 


/n 
Dr{x-x')  JJIV^^'^H^'),  y^w'''-'^Hx')]dx' .  (39) 

m  =  0 

(In  the  first  of  these  expressions,  we  have  symmetrized  the  right  hand  side  which 
is  permissible  since  the  operator  ^^,(a:)  and  ^(a;)  or  y){x)  commute  for  equal  times.) 
In  particular  we  have 

y)(i){x)  =fSR{x  —  x')yA^'^\x')y)''{x')dx'  (40) 

ip^^\x)^fip^^\x')yA''{x')S^{x-x')dx'  (41) 

A^^^x)  =  ilDR{x  —  x')[y)^^\x'),  y^ip^^^x')]dx' .  (42) 

The  general  expression  for  tp{x)  or  A^{x)  is  very  complicated.  We  must  note  the 
occurrence  even  in  the  first  approximation  ip^^\x)  of  the  product  A^^\x)ip^{x) 
which  impHes  transition  from  the  vacuum  to  a  two  electron  state 

iO\y){x)\q,  ky  =  e  f  Se{x  -  x')  y,<^0\A(^Hx')\ky<:0\y)^^\x')\qy  dx'  +  ' ' ' .      (43) 

This  is  a  peculiar  feature  of  coupled  fields  which  is  not  present  in  the  case  of 
free  fields  where  the  operators  applied  on  the  vacuum  produce  only  states  with 
one  particle.  Matrix  elements  for  products  of  operators  can  be  simplified  by  intro- 
ducing intermediate  states  and  summing  over  them.  For  example, 

<0 1^(0) (^)  1^(0) (^)^  Ai^J{x')}\qy=  2;<0\yj^'H^)\^><^\{A^J:Hx),  4?(^')}k>-      (44) 

z 

The  nature  and  number  of  the  intermediate  state  are  restricted  by  the  nature 
of  the  operators.  (In  the  above  example,  the  matrix  element  has  a  non- vanishing 
value  only  if  the  intermediate  state  is  a  one-electron  state.) 
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We  know  that  the  "in"  and  "out"  fields  satisfy  the  same  canonical  commu- 
tation relations.  Hence  there  must  exist  a  unitary  matrix  8  such  that^ 

w^''\x)=S-^y^^{x)S  (45) 

A^^^{x)=S-^A^;i'{x)S  (46) 

S^S  =  SS'=:l.  (47) 

If  we  assume  as  before  that  the  state  vector  at  ^  =  —  oo  is  an  eigen  function  of 

H  0  (in)  and  at  ^  =  +  oo  that  of  H^  (out)  where 

^(O)(^(out)^  ^(out))  _  ^-1^(0)  (^(in)^  ^(in))  ^  (4g) 

then  it  is  clear  that  the  probability  that  the  state  \ny  goes  over  to  \n'y  by  the 
interaction  is  given  by 

\<n'\S\ny\^. 

This  merely  implies  that  the  transformation  matrix  S  is  just  the  >S'  matrix  of 
collision  theory  with  the  operators  in  the  Heisenberg  representation.  In 
calculations,  it  has  been  found  more  convenient  to  work  with  the  S  matrix  in 
the  interaction  representation. 

Propagators  in  field  theory 

In  the  foregoing  methods  of  computation,  we  have  used  the  momentum 
representation  of  the  initial  and  final  states  since  we  are  interested  only  in  the 
matrix  elements  for  transition  between  definite  momentum  states.  If  we  wish  to 
use  the  configuration  representation  for  the  initial  and  final  states,  then  we  are 
constrained  to  use  a  kernel  function  connecting  the  space-time  points  associated 
with  the  wave  functions  of  the  initial  and  final  particle  systems  and  integrate  out 
over  these  space -time  points.  This  process  should  be  identical  with  that  using 
initial  and  final  states  in  momentum  representations  and  the  /S-matrix  expansion. 
In  defining  propagators  in  field  theory  it  is  necessary  to  remember  the  correspon- 
ding concept  in  the  Feynman  formalism  where  we  were  dealing  with  propagators 
of  single  particles,  and  the  propagators  of  systems  of  particles  were  defined  in 
terms  of  these,  as  for  example  the  two  particle  propagator  in  the  presence  of  an 
interaction  or  the  modified  propagator  of  a  single  particle  assuming  interaction 
(self  field).  In  field  theory  however  we  can  only  speak  of  propagators  connecting 
space-time  points  at  which  the  particles  are  annihilated  and  those  at  which 
particles  are  created. 

Before  obtaining  a  general  expression  for  the  kernel  or  propagator  let  us  take 
the  case  when  the  initial  system  consists,  say,  of  a  fermion  and  a  boson  of 


C.  N.  Yang  and  D.  Feldman,  Phys.  Rev.  79,  972  (1950). 


140  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

momenta  p  and  k  respectively  and  is  represented  by  a^ {k)h^ {p)^^.  We  first 
recognize  that 

a^k)h^'p)\  =  A  j  ip{x)(p{x)u{p)Q-'^^"'^^'y^dixdy  (49) 

where  xp  and  cp  represent  the  fermion  and  boson  field  operators  respectively  and  A 
the  normalization  factor.  Similarly  for  the  final  system  we  can  write 

<]b{p')  a(k')  =  A  jip{x')  (p{y')u{p')  e+^^^'-^'  +  ^-'-^'M^'  d^y' .  (50) 

In  operating  the  /S-matrix  on  the  state  a^  {k)b'^  {p)yQ  the  annihilations  of  the 
particles  with  momentum  p  and  k  may  occm*  at  any  one  of  the  vertices  in  the  r^-th 
order  term  of  the  /S-matrix  expansion.  This  circumstance  is  taken  into  account 
when  using  the  wave  functions  in  configuration  representation  by  defining  a 
propagator  as 

G{x',  X,  y',  y)=-iT{xp{x')  q^{y')Sip{x)  cp{y))y,  (51) 

c 

where  S  implies  the  /Sf- matrix  expansion  and  the  T- operator  operates  only  after 
explicitly  writing  the  /Sf-matrix  expansion,  i.e.  on  the  entire  aggregate  of  terms 
y)(x')(p(y')y){x)(p(y)  and  the  terms  in  the  integrand  of  the  n-th.  order  term  of  the 
^S-matrix.  c  is  the  normalization  factor  defined  as  QiS^Q. 

Thus  integrating  with  respect  to  the  space-time  points  corresponding  to  the 
arguments  of  the  operators  of  the  propagator,  i.e.  of  the  wave  functions  of  the 
initial  and  final  systems,  merely  implies  taking  the  momentum  transform  of  the 
propagator  with  arguments  corresponding  to  the  momenta  of  the  initial  and  final 
states.  Thus  the  propagator  is  a  four-point  function  involving  two  fermion  and 
two  boson  lines  and  therefore  can  be  used  for  obtaining  transitions  not  merely 
from  an  initial  fermion  plus  boson  state  to  a  final  fermion  plus  boson  state  but 
also  for  two  other  processes,  viz.  fermion  +  anti-fermion  ->  2  boson  states  or 
vice  versa.  (This  is  implied  in  the  use  of  the  operators  ^  and  y)  in  the  propagator.) 
Thus  the  matrix  element  for  all  these  processes  is  obtained  from  the  same  Green's 
function  by  integrating  over  suitable  initial  and  final  wave  functions. ^  This 
circumstance  imposes  certain  conditions  on  the  matrix  element  when  viewed  as 
the  boundary  value  of  an  analytic  function.  This  is  the  starting  point  of  the  theory 
of  dispersion  relations  which  will  be  discussed  in  a  later  chapter. 

We  define  the  function  G{x' ,  x)  as: 

-oo  -oo 

We  shall  examine  the  meaning  of  the  individual  terms  in  the  expansion  oiG{x' ,x) 
and  the  normalization  factor  c  in  terms  of  Feynman  diagrams.  At  every  vertex 
corresponding  to  a  H^^^^{x)  there  is  only  one  photon  line  and  as  O  represents 


^  See  for  the  derivation  of  a  differential  equation  for  G{x,  x'),  J.  Schwinger,  Proc.  Nat. 
Acad.  Sci.  37,  452  (1951).  See  also  N.  N.  Bogoliubov  and  D.  V.  Shirkov,  loc.  cit.,  pp.  416. 
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only  two  electron  lines,  it  is  obvious  that  the  number  of  vertices  should  be  even. 
The  numerator  is  thus  given  upto  the  fourth  order  as: 

iT(ip{x')lp  (x))  >o  (zeroth  order) , 

iT  (y)(x')^{x)  W^'^x^)  H'''\x^))yo  (second  order)  }     (53) 

and   <T(y){x')  y}{x)  H^\xj)  H^^'^ix^)  H'^'^x^)  IP''^{x^))yQ      (fourth  order) 

the  last  two  being  represented  by  Figs.  21a  and  21b  respectively. 


X'        Xp  X,        X 


X     X4  X3   X2  X,    X  X'   X^      X3  X2      X|       X  X'    X4   X3 


;0< 


X'  X4 


X'  X  x"^  *  X 

X^F ^   X,  X^r- -;,   X, 

x|^ ^  X2  xr- ^  X2 


Fig.  21.  Diagrammatic  representation  of  the  numerator. 

The  terms  within  brackets  are  vacuum  to  vacuum  transitions  through  virtual 
creation  and  annihilation  of  pairs.  The  denominator  which  is  o<'S^>o  is  given  up  to 
the  fourth  order  as 


D=l  + 


\            / 
\             / 

'            \ 
/               \ 

+ 

/    \ 
/             \ 

Fig.  22.  Diagrammatic  representation  of  the  denominator. 
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i.e.  a  sum  over  all  possible  vacuum  fluctuation  terms.  It  is  to  be  noted  that  the 
vacuum  cannot  be  connected  to  any  other  state  by  the  >S-matrix.  This  implies 
that  if  we  do  not  have  any  initial  particle,  there  will  not  be  any  emergent  ones, 
i.e.  the  vacuum  is  an  eigenstate  of  the  /S-matrix.  The  unitarity  of  the  /S-matrix 
requires  the  eigenvalue  to  be  unimodular  so  that  we  can  write  SyQ^  e^^y^ 
or  D  =  e*^  as  o<l>Q  =  1.  Thus  by  separating  out  the  bubble  terms  in  the  numerator 
as  a  multiplicative  factor  e***  which  cancels  with  the  denominator,  we  have 
G(x',x)  =  /SJ-  the  propagator  for  the  electron  in  the  presence  of  its  self-field. 
More  generally  we  can  define  a  propagator  connecting  N  points  such  that 
N  =  2n  +  m-^  +  rric,,  2n  points  corresponding  to  the  functions  V'(^i)'V^(^2)---^(^«) 
and  ip(Xi)  .  .  .  ip{Xn),  m^  corresponding  to  99(2/1),  9^  (2/2)  •  •  •  9^  (2/mi)  ^^^  ^2  ^^ 
95(2/1)  •••  9^(2/^2)  inside  the  time  ordered  product  which  also  contains  the 
^'"H^i)  •  •  •  ^^^^i^n)  term.  This  propagator  is  used  to  obtain  matrix  elements 
from  initial  to  final  states  where  every  ip  (x)  occurring  in  the  propagator  corresponds 
to  either  an  incident  electron  or  an  emergent  positron  and  ip{x)  to  an  emergent 
electron  or  an  incident  positron.  In  collision  problems,  usually  the  incident  particles 
are  only  two  in  number  while  the  emergent  particles  can  be  any  number,  the 
restriction  being  only  due  to  conservation  laws.  In  particular  we  can  write  the 
propagator  involving  four  external  lines  (two  fermion  and  two  boson  lines)  as 
in  (51).  In  quantum  electrodjmamics  and  quantum  mesodynamics  the  form  of 
the  interaction  Hamiltonian  of  the  Yukawa  type  is 

H'^'=^eip{x)y^y^(x)A^{x)  (54) 

^i^*  =  gf  {x)y^ritp  {x)(pi  (x) .  (55) 

We  shall  now  explicitly  obtain  the  rules  for  using  the  above  propagator  in  cal- 
culating matrix  elements.  These  can  be  carried  over  without  any  difficulty  for 
propagators  involving  more  than  four  external  lines. 

The  method  described  in  the  previous  chapter  consisted  in  taking  the  expec- 
tation value  of  the  /S-matrix  between  the  initial  and  final  states  which  were 
described  by  creation  and  annihilation  operators  operating  on  the  vacuum.  In 
propagator  formalism,  we  have,  instead  of  the  creation  operator  corresponding  to 
each  member  of  the  initial  system,  field  operators.  Since  these  involve  space-time 
points  their  position  in  the  time  ordering  is  determined  by  the  P  operator ;  which 
means  the  time  point  for  an  interaction  vertex  could  be  earlier  than  that  of  a 
field  operator.  In  the  previous  picture  we  started  with  incident  bare  particles 
and  the  interaction  was  supposed  to  be  switched  on  and  switched  off,  the  duration 
of  the  interaction  being  assumed  to  tend  to  00.  Here  since  the  field  operators 
are  taken  along  with  the  interaction  terms,  we  are  assuming  that  the  interaction  is 
operative  all  the  time  and  therefore  we  are  dealing  with  the  propagator  of  physical 
particles. 

We  should  note  an  interesting  feature  of  the  momentum  transform  of  the 
propagator.  The  annihilation  of  the  initial  electron  at  a  vertex  may  be  followed 
by  the  creation  of  the  same  electron  at  another  vertex  if  the  two  vertices  are 
connected  by  the  emission  and  the  absorption  of  a  photon  (electron  self-energy 
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diagram).  But  the  existence  of  the  matrix  element  implies  that  the  initial  particles 

should  not  occur  in  the  final  state.  Thus  they  will  be  finally  annihilated  at  some 

vertex  after  which  we  do  not  take  the  self-energy  processes  into  consideration. 

Thus  we  should  expect  the  momentum  transform  of  the  propagator  to  be  of  the 

form 

rip',  p,k',h  =  J^i  {k')  S-/{p')  G{p',  p;  k',  k)  8-^^{p)  A-/{k)  (56) 

and  since  we  are  interested  only  in  transitions  from  real  physical  initial  states 
to  real  physical  final  states,  we  write  the  matrix  elements  as: 

u(p')y(p',p;  k',k)u{p). 

We  shall  now  establish  a  relationship  between  the  propagators  in  terms  of 
interaction  and  Heisenberg  operators.  For  this  purpose,  we  recall  that  if  we  take 
the  time  at  which  the  two  representations  coincide  to  be  Iq  ,  then  the  state  vectors 
and  operators  in  the  two  representations  are  related  by 

\0i{t)y=^  U{t,t,)\0jj{t,)y 

and 

Os[t)==U-^{t,t,Oj{t)U(t,t,)  (57) 

The  propagator  in  the  interaction  representation  is 
G{Xi x^-  '  •',  0(^,  ^2  •  •  •) 

oo  oo 

=  H  ^-r-    /  •  •  •   /  do^i . .  .  dXn<T(yjj{x[)  .  .  .  ipjix^J  Hiix,)  •  •  •  ^;(a;,))>o 
«  =  o     "-•      J         J 

-  oo         -oo 

=  <T(C7(+oo,  -oo)  y^j{x'^)  •  .  .  ^HxO))>o 

=  iT(U {+00,  g  U{to,  ri)  y^j{x^)  •  •  •  U{to,  4)  y^jix^)  U{t%,  g  C7(g  -oo)))^ 

=  (Wo,'^{WhK)"-Wh{<))Wo),  (58) 

where  y)Q  represents  the  "true"  or  physical  vacuum  and  where  all  possible  time 
orderings  of  the  operators  have  been  taken  into  account.  The  last  line  represents 
the  propagator  in  the  Heisenberg  representation. 

5.  TRANSFORMATIONS  AND  GENERAL  PRINCIPLES 
OF  INVARIANCE^ 

General  considerations 

So  far,  in  developing  quantum  field  theory,  we  have  assumed  the  equation  of 
motion  to  be  the  generahzation  of  the  Schrodinger  equation  which  states  simply 
that  the  rate  of  change  of  the  state  vector  with  time  is  obtained  by  the  Hamiltonian 


1  This  section  is  based  mostly  on  lectures  given  by  P.  T.  Matthews  at  the  University 
of  Rochester,  1957  on  "The  Relativistic  Quantum  Theory  of  Elementary  Particle  Inter- 
actions." See  also  N.  Kemmer,  J.  C.  Polkinghorne  and  D.  L.  Pursey,  "Invariance  in 
Elementary  Particle  Physics",  Bep.  Progr.  Phys.  XXII,  368  (1959). 
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operating  on  the  state  vector.  The  formal  method  of  obtaining  the  equation  of 
motion  is  to  begin  with  a  Lagrangian  function  and  impose  on  it  the  principle 
of  least  action.  If  we  denote  by  Li(p^{x),(p'i{x))  the  Lagrangian  density  for  any 
field  where  (pcd^)  and  (p^{x)  are  the  field  variables  and  their  first  order  space  or 
time  derivatives,  then  the  equation  of  motion  is  obtained  by  the  variational 
principle  (as  in  classical  mechanics)  thatf^dt  should  be  anextremum  (where  ^ 
is  the  total  Lagrangian)  or 

df^dt  =  0     or     dfLd^x  =  0  .  (1) 

where  L  is  the  Lagrangian  density. 
This  leads  to  the  equation  of  motion 

dL  d       dL 

Since  we  assume  that  L  does  not  contain  derivatives  higher  than  the  first  order, 
the  equation  of  motion  will  be  at  most  of  the  second  order.  We  also  note  that 
there  is  a  certain  arbitrariness  in  the  choice  of  a  Lagrangian,  i.e.  we  can  always 

add  a  four-divergence  of  the  type  -r— ^  dX^ ,  (99^99^)  and  yet  obtain  the  same  equation 

r       dA^ 
of  motion.  The  integral  /  d^ a; -^ — (99^,  99J)  can  be  converted  into  a  3-dimensional 

surface  integral  by  the  Gauss  theorem  so  that  its  variation  vanishes  identically. 

Although  the  two  Lagrangians  L  and  L'  give  rise  to  the  same  equations  of  motion, 

expressions  for  physical  observables  such  as  charge,  current  etc.  will  be  different 

and  hence  the  effect  of  an  interaction  with  an  external  field  will  also  be  different. 

We  can  now  proceed  in  analogy  with  classical  mechanics  and  define  the  Hamil- 

tonian  density  as 

H=Zn.<pl-L,  (3) 

(X 

where  „  y.  „ 

We  can  define  also  a  canonical  energy-momentum  tensor  by 

T,.,S¥i^-Ld„  (4) 

with  T44  and  T^  ^  corresponding  to  the  energy  and  momentum  of  the  field  which 
are  constants  of  motion.  Defining 

M^,=jd^x{x^T,,-x,T^,)  (5) 

the  space  part  of  which  corresponds  to  angular  momentum  we  find  that  if  angular 
momentum  is  to  be  a  constant  of  motion,  the  necessary  condition  is  that  T^,^, 
should  be  symmetric^,  i.e. 

-^  flV  V/l' 


^  F.  J.  Belinfante,  Physica  6,  887  (1939);  see  also  G.  Wentzel,  loc.  cit. 
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The  choice  of  the  Lagrangian  is  based  on  the  important  criterion  that  it  should 
yield  the  desired  equations  of  motion.  Since  in  many  cases  the  correct  equations 
of  motion  are  not  known,  physicists  have  been  guided  in  the  choice  of  the  La- 
grangian by  certain  general  principles  of  in  variance  under  various  transformations. 
These  principles  are  becoming  increasingly  important  especially  in  dealing  with 
the  interactions  of  strange  particles  (hyperons  and  jfiT-mesons)  as  we  do  not  know 
the  equations  of  motion  of  most  of  these  elementary  particles.  We  summarize 
these  principles  in  the  following  sections. 

We  shall  now  proceed  to  describe  in  a  rather  detailed  manner  the  meaning  of 
invariance  with  respect  to  a  transformation.  For  this  purpose,  it  will  be  convenient 
to  discuss  at  first  transformations  of  space-time  points.  If  a  space-time  point  P 
has  coordinates  x  in  one  system  and  if  we  define  the  Lorentz  transformation 
x'  =  {^A^x,  we  imply  that  in  the  transformed  system  the  "same  physical"  point  P 
has  coordinates  x' .  With  every  point  P  let  us  associate  a  physical  quantity  /  (P) 
which  is  a  function  of  P.  In  terms  of  the  variable  x,  let  it  be  defined  as  cp{x). 
If  from  the  physical  nature  of  the  problem  the  value  of  a  physical  quantity  does 
not  depend  upon  the  coordinate  system,  then 

f{P)  =  cp(x)  =  cp(A-^x')  =  cp'(x').  (6) 

We  have  merely  replaced  x  in  terms  of  x'  and  obtained  the  physical  quantitj^ 
as  a  function  of  x' .  This  procedure  implies  the  invariance  of  the  physical  quantity 
under  the  transformation  A.  More  generally,  the  physical  quantity  associated 
with  X  may  not  be  independent  of  the  transformation.  For  example,  if  it  is  re- 
presented by  a  set  of  functions  (poc(x)((X  =  1,  2,  .  .  .)  as  in  the  case  of  a  spinor  or 
a  vector,  then  in  the  transformed  coordinate  system  we  may  have  quite^generally 

^'Ax')  =  2JS,p<pp{x)  =  ZS.pni'^'^^'),  (7) 

where  99  and  99'  are  column  vectors  and  8  is  a.  matrix.  To  obtain  99'  we  have  not 
only  replaced  x  in  terms  of  x^  but  operated  S  on  (p.  The  variation  in  the  value 
of  the  physical  quantity  at  the  same  physical  point  due  to  the  transformation 
is  through  the  operation  of  S.  A  trivial  example  of  99  is  the  position  vector  of  the 
point  P. 

In  equation  (7)  we  have  compared  the  values  of  the  functions  at  the  same 
physical  point  and  hence  the  arguments  x^  and  x  are  not  equal.  If  we  wish  to 
compare  the  values  of  the  functions  for  the  same  parametric  values  we  merely  write 

f'^^^-^s^^lMD  +  i^lj.],  (8) 

where  I  is  used  to  denote  the  parametric  value  of  x  and  the  infinitesimal  e^^  is 
defined  through  the  relation 

K=Ui^MV  +  ^f.v)^v^  (9) 

V 

Thus  we  note  that  there  are  two  contributions  to  the  difference  between  the 
functional  values  corresponding  to  points  in  the  two  coordinate  systems  which 

EPCR      10 
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have  the  same  parametric  value.  One  is  due  to  the  difference  in  the  values  of 
the  coordinates  when  referred  to  same  coordinate  system  and  the  other  due  to  S. 
It  is  convenient  to  call  S  the  matrix  of  intrinsic  variation.  If  we  are  considering 
the  same  physical  point,  there  is  no  need  for  such  separation  because  the  only 
change  is  due  to  intrinsic  variations. 

We  shall  now  generalize  the  concept  of  invariance.  Let  (p{x)  satisfy  certain 
equations  involving  x.  If  99'  (x')  satisfies  the  same  equation  involving  x',  then  the 
equation  for  cp  is  covariant.  The  Dirac  equation  is  covariant  under  Lorentz 
transformation  and  this  is  demonstrated  by  proving  that  it  is  always  possible  to 
find  a  matrix  S  such  that 

^'(o:')  =  Sy){x), 

when  If  and  Y^'  are  the  wave  functions  in  the  two  Lorentz  frames  connected  x'  ^-Lx, 
L  being  the  Lorentz  matrix  and  \p  and  ip'  are  assumed  to  satisfy  the  same  equation. 
S  is  completely  defined  by  the  equation 

V 

It  is  not  necessary  that  the  physical  quantity  should  be  represented  by  a  set 
of  functions  for  S  to  exist.  Even  if  it  is  defined  by  one  function  99  [x),  quite  generally 
q)' (x')  =  S(p{x).  In  physics  we  usually  deal  with  functions  with  S  =  ±1  under 
the  transformation  x^  =  — x,  i.e.  space  reflection.  When  aS^  =  +  1  it  is  called  a 
scalar  function  and  when  S  =  —  1  we  call  it  a  pseudoscalar  function.  Scalar 
functions  are  easily  cited  while  pseudoscalar  functions  can  be  arrived  at  for 
example  as  follows. 

The  scalar  product  of  two  vector  fields  associated  with  space  is  a  scalar  function. 
But  the  scalar  product  of  a  polar  vector  and  an  axial  vector  is  a  pseudoscalar. 
The  most  obvious  example  of  a  pseudoscalar  is  the  scalar  product  of  linear 
momenturn  and  angular  momentum. 

The  relation  between  99'  {x')  and  99  (x)  can  also  be  expressed  by  a  transformation  U , 
i.e. 

q)'{x')^  U-^q){x)U  =  S(p{x).  (10) 

If  we  require  that 

U-^L[(p{x)\U  =  Licpix)-]  (11) 

U  must  commute  with  L\(p{x)'\,  i.e. 

[L,U]^0.  (12) 

If  JJ  can  be  written  as  e~^^*  where  (X  is  a  continuous  parameter,  then  the  change 
in  99  for  an  infinitesimal  change  d(x  oioc  is  given  by 

idcp^[F,xp]doc;          i^-  =  [F,cp]. 

If  the  Lagrangian  is  invariant  under  U,  then  it  should  commute  with  both  F 
and  JJ.  Hence  the  Hamiltonian  should  also  commute  with  F  and  JJ.  Since  the 
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Hamiltonian  is  a  constant  of  motion,  F  is  also  a  constant  of  motion.  The  eigen- 
values of  F  are  observables  and  should  be  real;  hence  F  should  be  hermitian 
and  U  unitary.  Thus  the  invariance  with  respect  to  the  unitary  transformation 
has  generated  and  conserved  quantity  /,  the  eigenvalue  of  F. 
The  transformations  can  be  broadly  divided  into  two  categories, 

(a)  Continuous  transformations,  e.g.  Lorentz  transformation,  gauge  trans- 
formations, etc. 

(b)  Discrete  transformations,  e.g.  when  J.  the  operator  of  transformation  applied 
twice  becomes  an  identity  operator,  i.e.  A^  =  +1.  The  eigenvalues  of  the  operator 
are  ±  1  as  for  example  these  of  space-reflection,  charge  conjugation  and  time  reversal. 
We  will  now  enumerate  some  well  known  transformations  and  the  corresponding 
conserved  quantities. 


Type  of  transformation 

Conserved  quantity 

Remark  about 
invariance 

(a)  Translation  in  four- 

Energy-momentum 

Exact 

dimensional  space 

four-vector. 

(b)  Space  rotation 

Angular  momentum 

Exact 

(c)  Space  inversion 

Parity 

Approximate 

(d)  Time  inversion 

— 

— 

(e)  Charge  conjugation 

Number  of  particles  minus 
number  of  antiparticles 

Approximate 

(f)    Gauge 

Charge  and  current 

Exact 

(g)  Baryon  Gauge 

Heavy  particle  number 

Exact 

(h)  Rotations  in  3-dimensional 

Total  isotopic  spin  /. 

Approximate 

isotopic  spin  space. 

(i)    Rotation  about  third  axis  in 

/3-projection  of  /  along 

Approximate 

isotopic  spin  space 

third  axis 

We  shall  discuss  the  various  types  of  transformations  one  by  one. 

Particular  transformations 

Tra7islatio7i  in  four -dimensional  space 

If  the  Hamiltonian  is  invariant  for  infinitesimal  translation  of  space-time^ 
i.e.  x'  =  x„  +  dx„  where  dx„  is  an  infinitesimal  4-vector,  then 


Since  dx^,  is  arbitrary 


V     dx^ 


dH 

dx„ 


-0     or 


dx^ 


H 


(14) 


(15) 


:e.   the  four-momentum  vector  — —  commutes  with   the   Hamiltonian   and   it 

ox 
a  constant  of  motion.  " 
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1 

Space  rotation 

Consider  now  an  infinitesimal  rotation  ct>  of  the  coordinate  system  about  the 
origin.  Its  components  co^ ,  cog ,  CO3  represent  rotations  about  x,  y  and  z  axes  respect- 
ively, i.e.  1*'  =  1*  +  (O  X  f .  The  corresponding  momentum  vector  will  become 

2>'  =  p  +  o)  X  p.  (16) 

The  change  in  the  Hamiltonian,  dH,  should  vanish. 

dff=(a.xr),.  — +(«,xp)„.  — =  0  (17) 


-|(»-x^)  +  (l>x^))=0.  (18) 

Noting  that  I  =  rxp  where  I  is  the  orbital  angular  momentum  we  see  that 
o)  '  I  =  0,   i.e.   Hs  a  constant  of  motion.  i 

Lorentz  transformation 

We  have  already  referred  to  the  Lorentz  transformation  properties  of  single 
particle  Dirac  equations.  However  the  following  observation  is  necessary  when 
we  replace  the  c-number  wave  functions  by  g-nijmbers,  i.e.  operators.  The  Lagran- 
gian  density  should  be  properly  symmetrized  or  antisymmetrized  to  remove  the 
unobservable  singularities.  This  will  be  illustrated  for  the  Dirac  field.  The  free 
Lagrangian  density  for  the  Dirac  field  is^ 
8 


=  \p^ uip     where     u^  lyAYn "^ i"  ^ ) [ 

=  Z'f^ap9vMydva==  I]'^ap[i{fvVp-n%)  +  HfvWp  +  W^Wv)]{y^)va'       (1^) 

The  second  term  is  a  singular  c-number 

^«^  {%  Wp  +  W^  Wv)  (r4)m  =  ^  (^«^  ^vf)x=Q{y'dva  (20) 

and  is  to  be  discarded  because  it  will  cause  merely  a  constant  (though  infinite) 
shift  of  the  eigenvalue.  Therefore 

=  i[tpy^uy)-y)u'^y^f] 

1  It  is  to  be  noted  that  in  this  section  we  use  the  metrie  (5^^  =  1  for  ^  =  v  =  1,  •  •  •  4  and 
S^y  =  0  for  fi=^v.  The  y^ij  =  I,  •  ■  -  3)  matrices  used  here  are  -  i  times  the  Feynman  y^.g 
while  74  is  the  same. 
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We  shall  now  consider  the  three  important  discrete  transformations-space 
iversion  P,  charge  conjugation  G  and  time  inversion  T.  In  doing  this  for  P 
id  C  we  shall  adopt  the  procedure  of  assuming  the  transformations  for  the  field 
triable  and  obtain  the  transformations  for  annihilation  and  creation  operators 
iirectly  therefrom.  For  the  T-operation  the  procedure  is  a  little  more  complicated, 
fince  it  is  not  sufficient  to  define  the  operation  for  a  single  operator  but  it  is  neces- 
iry  to  define  it  with  respect  to  a  sequence  of  operators  from  physical  arguments. 

inversion 

Complex  scalar  and  pseudoscalar  fields :  Let  P  represent  the  operation  of  space 
jflection, 

i.e.     P:x^x'^—x     and     P:  x^^  x'^^^  x^. 

le  complex  field  variable  cp  (x)  transforms  as 

Pep  (x)  P-i  ^  cp'  {X')  =  np(p  {X)  =  ri^fp  i-x',  X,) ,  (22) 

rhere  rjp  can  be  complex.  If  we  assume  that  reflection  performed  twice  is  equi- 
ralent  to  an  identity  transformation, 

rjl  =--  +  1 ,     i.e.     r\p=  ±  1 . 

ty^=  +  1  \i  (p{x)  is  a  scalar  field  and  7]^=  —\  if  (p{x)  is  a  pseudoscalar  field. 
7]^  is  called  the  parity  quantum  number.  From  the  transformation  properties 
of  (p  [x)  we  can  deduce  those  of  the  annihilation  and  creation  operators  in  the 
expansion  of  cp(x).  Omitting  normalization  factors,  the  Fourier  decomposition 
of  (p  {x)  is 

(p{x,  x^  =  JJ  [a{k)  e^C^-*-**^^)  +  b\{k)  e-^C^-^-^**^*)].  (23) 

k 

Then  we  have 

P (p{x)  P-^  =  2;[Pa{k)  P-^e-i(^'^">  -^  PbHk)  P-^e-^^-^'] 


k 

=  r]p(p{-x',  x^)  =  V[?y^a(fe)e*<-''-*'-^-*^«)  +  r/^,6t(fe)  e" *("*•*' -^•«^*>] 


m 

■  therefore  P  a{k)  p-^  =  fjpa{-k) .  (25) 

I  PbHk)P-^  =  7]pbH-^)-  (26) 

I 


Similarly  we  have 

PaHk)P'^  =  f]paH-k);     Pb{k)  P'^  =- r]pb{-k).  (27) 

^o  understand  the  meaning  of  these  equations  consider  the  one  particle  state 


«^(fe)>o-  Making  the  natural  assumption  that  the  vacuum  state  has  even  parity, 
i.e.  P>o  =  >o  we  get 

PaHk)>o  =  PaHk)  P-iP>o  -  nv<^H-k)y,.  (28) 
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The  parity  operation  changes  the  one  particle  state  with  momentum  k  to  a  one 
particle  state  with  momentum  — fe.  If  rjp  =  +1  the  particle  has  even  intrinsic 
parity  and  if  r]p  =  —  1  it  has  odd  intrinsic  parity.  If  we  expand  cp  (x)  not  in  terms 
of  free  particle  wave  functions  as  above  but  in  terms  of  the  eigenf unctions 
of  the  angular  momentum  operator  we  have 

q^{x)=\2](in'f^n{x)  +  K<{x)],  (29) 


where 


^n{x)  =/-p^l/7^/^V.(^^)  YlmiO,  (P)  d^k. 


(30) 


Z  =  0,  1,  2  .  .  .  denote  S,P,D  .  .  .  states. 

Proceeding  as  before  we  can  show  that  a  particle  having  an  even  intrinsic 
parity  (i.e.  a  scalar  particle)  existing  in  a  state  of  angular  momentum  I,  m  has  a 
parity  ( — 1)^  (i.e.  the  parity  of  Yi^),  but  for  a  pseudoscalar  particle  (i.e.  odd 
intrinsic  parity)  the  parity  is  (—1)^^^. 

Electromagnetic  field:  The  electromagnetic  field  is  a  four  vector  Af^{A,  A^)  and 
it  transforms  under  space  reflection  P  as 

PAk(x,  x^)  P-i  =  -Aj,{—x,  x^)         k=  1,2,3 

PA^P-^  =A,(-x,x,).  (31) 

These  properties  follow  from  the  fact  that  the  magnetic  field  H  is  an  axial  vector 
and  the  electric  field  ^  is  a  polar  vector.  Since  for  a  pure  radiation  field  ^4=0 
the  photon  has  intrinsic  parity  —  1 .  ^ 

Spinor  field :  Applying  space  inversion  we  have 

Pyj  (x)  P-i  =  y^p  {X')  =  Sprp  (x) .  (32) 

Sp  is  so  chosen  that  the  free  Lagrangian  density  for  a  spinor  field  is  left  invariant. 
The  choice /S^^  =  77^,74  where  77^,  is  a  complex  number  of  modulus  unity  is  sufficient. 
Reflecting  a  coordinate  system  twice  is  equivalent  to  a  rotation  through  an  angle 
271.  Under  this  rotation  a  vector  gets  reflected  twice  whereas  a  spinor  only  once. 
So  a  rotation  through  4:7t  is  necessary  so  that  the  spinor  is  restored  to  its  original 
state.  Hence 

r)p^  ±1,     i.e.     r]p=  ±1,  ±i. 

Yang  and  Tiomno^  have  classified  the  four  kinds  of  spinor  fields  as  ^,  B,  C 
and  D  according  as  r]p  =  +i,  — *,  +  1  and  — 1  respectively. 

As  in  the  case  of  the  boson  field,  we  can  prove,  making  use  of  the  expansion 
for  y){x)  and  y){x)  in  terms  of  annihilation  and  creation  operators  and  noting 
yiUiik)  =  u^i—k),  y^Viik)  =  —Vi{—k)  that 

Ph,{k)P-^^r^ph,{-k)  (33) 
Pdl{k)P-^=^-7]pdl{-k).                                      (34) 

^  This  result  is  consistent  with  the  classical  theory  of  the  electromagnetic  field. 
2  C.  N.  Yang  and  J.  Tiomno,  Phys.  Rev.  79,  495  (1950) .  F.  Gursey  has  adopted  this  result 
to  establish  a  connection  between  strangeness  and  parity.  See  Phys.  Rev.  Lett.  1.  98  (1958). 
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On  applying  the  P  operator  to  the  one  particle  state  we  find  that 

PK{k)\  =  %bl{-k)yo  (35) 

P dl  {k)y,  =  -  r^^,  dl  (- k)\.  (36) 

For  the  case  r]p=  ±1,  the  anti-fermion  has  a  parity  and  momentum  opposite 
to  that  of  a  fermion. 

Having  determined  the  transformation  properties  of  xp  and  %p  it  is  easy  to  study 
the  transformation  properties  of  bilinear  covariants  yjQiip  where  Q^  's  consist 
only  of  the  y-ma trices.  Under  space  reflection  P  we  have 


PifQiip)  P-'  =  PipP-^iPy^P-^  =  ipyAy^W' 

In  the  table  below  are  listed  the  properties  of  the  covariants. 


(37: 


^i 

y.ihy. 

Remark 

I 

I 

Scalar 
Vector 

Y  iy^i  Vv  -  Yv  7^) 

~^{y^.yv-yvy(x)'^  either^  or  v  =  4 

Tensor 

iys 

+  *v^y5;  /i  =  i,  2,  3 

Pseudovector 
Pseudoscalar 

It  should  be  noted  that  these  transformations  will  also  depend  upon  the  types 
of  spinors  xp  and  xp.  Demanding  that  the  interaction  of  a  spinor  field  with  a  boson 
field  should  be  a  scalar,  we  can  write  down  the  proper  interactions  immediately 
from  the  transformation  properties  of  the  boson  field  using  the  table.  For  example 
consider  a  pseudoscalar  field  (p  interacting  with  the  Dirac  field  xp  with  ly^  =  +  1 . 
Then  the  interaction  will  have  the  form  igxpy^xpcp.  This  is  the  famous  Yukawa 
interaction. 

Charge  conjugation 

We  can  define  an  operator  U^  which  transforms  the  field  cp  into  its  hermitian 


conjugate  as 


Uc(p{x)  U-^  =  r],(p^x) 


(38) 


This  as  we  shall  see  is  equivalent  to  changing  a  particle  to  its  anti-particle.  If 
the  particle  is  charged  its  anti-particle  has  the  opposite  charge.  However  even  for 
neutral  particles,  anti- particles  exist  and  the  operation  17^  which  switches  the  particle 
to  its  anti-particle  and  vice  versa  should  strictly  be  called  particle-anti- particle 
conjugation  rather  than  charge  conjugation. 
We  shall  now  consider  the  effect  of  this  operation  on  various  fields. 
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Complex  fields :  Consider  a  non-hermitian  spinless  field  (scalar  or  pseudoscalar)  (p 
which  is  expanded  as 

q){x)  =  2] {a{lc)  eif^' '^  +  b^k)  e-i^--'} .  (39) 

k 

Performing  the  charge  conjugation  operation  on  cp  we  have 

k 

rj^ (p^x)  =  r]c2J[(^HJ^)  e"'*""^  +  b{k)  e'^"^]  (41) 

k 

Since-  U,(p{x)U;^  =  r],(p^x)  (42) 

we  get  U,  a  {k)  U- 1  =  r],  b  {k)  (43) 

and  U,  6t  (fe)  U-^  =  rj,  at  (fe) .  (44) 

For  the  complex  field  cp  ^  cp^  and  therefore  the  transformed  field  variable  is 
different  from  the  original  one.  Thus  a  charged  particle  state  will  not  be  an  eigen 
state  of  Uc . 

Neutral  fields :  For  a  neutral  field  (scalar  or  pseudoscalar)  (p  =  cp^  and  hence  b]. 
and  b\  are  identical  with  aj^  and  a\  respectively.  A  neutral  particle  state  will  be 
an  eigenstate  of  11^  and  hence  7]^  =  r]*  and  is  real.  Since  ?7c  =  1,  r}c=  ±1,  i-e. 
we  have  two  eigenstates  of  Z7g  corresponding  to  t/^  =  +1  and  77^==  —  1  respect- 
ively. We  shall  make  use  of  the  possibility  of  the  two  eigenstates  of  Uc  while 
disscusing  K^  and  K^. 

Spinor  field :  As  before  we  write 

Ucy^{x)U;^=StpUx)  =  rpc,  (45) 

where  8  =  r]cC*  with  the  C  matrix  satisfying  the  properties 

CyC-^  -  y*,  C*  =  C-i  and  C'^  =  -C,  (46) 

Further  we  have 

u*{p)  =  Cv{p);     u{p)  =  C*v*{p).  (47) 

Thus 

U,y){x)  C7-1  =  y  2J[bl  ip)  c*  u^(p)  e"^^-^  +  d,{p)  c*  v,{p)  e^^'^.  (48) 

p      r 

But 

%{^)  =  ZUmP)  MP)  e'^-^  +  df{p)  Vrip)  e-^>-^  (49) 

p      r 

where 

¥,=  U,b,U-^     etc. 
Hence  we  require 

UMp)Ul^  =  r„dl(p)  (50) 

U,dAp)U;^  =  ti,bHp).  (51) 


^  Actually  here  v't  should  be  written  as  a  column,  i.e.  the  ^t  is  here  actually  (v't)^. 
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Electromagnetic  field:  The  transformation  properties  of  the  photon  field  A^ 
under  1]^  can  be  obtained  by  demanding  that  the  interaction  Hamiltonian  of 
the  electron-photon  fields  should  be  invariant  under  this  transformation.  This 
implies  that 

U,A„U-^  =  -A,,.  (52) 

The  transformation  properties  of  ^  and  ip  determine  the  corresponding  trans- 
formation properties  \xp,  Q  ip]  which  are  listed  below : 


i 

Q,                .//> 

s 

V 

1 

i  yf. 

1 
-1 

T 

Y^y/^yv-yvyf,) 

-1 

A 
P 

iy^^y-o 

iy^ 

^1 

+  1 

The  transformation  properties  of  bilinear  covariants  can  be  extended  to  more 
than  one  such  covariant.  A  typical  example  is  that  of  the  /5-decay  interaction 
given  by 

Hp  =  Z^dWa^iWh){Wc^iV>d)-^'h'C.  (53) 

where  a,  b,  c  and  d  represent  the  four  fermion  fields.  In  variance  of  H^  under 
charge  conjugation  requires  that  gi  's  should  be  real. 

Let  us  consider  the  Yukawa  interaction  of  a  boson  with  a  fermion  field. 
If  we  require  it  to  be  invariant  under  charge  conjugation,  we  can  identify  the 
transformation  factor  rjc  (i.e.  charge  conjugation  parity)  for  the  boson  fields 
defined  by 

C/,9,[7;>  =  4*)y.  (54) 

For  convenience  of  reference  we  tabulate  them  below : 


Boson  field 


^U) 


Neutral  scalar  or  pseudoscalar 
Photon 
Vector  meson 
Axial  vector  meson 


S,P 
V 
V 
A 


In  order  to  realize  the  use  of  this  transformation  we  cite  some  applications, 
(a)  If  the  interaction  is  left  invariant  under  Uc  so  also  will  be  the  /S-matrix,  i.e. 

<f\8\iy  =  <f\u;^su,\iy  =  <f\s\i<^y.  (55) 

which  means  for  example  the  scattering  of  tz'^  from  protons  is  identical  with  Ji" 
from  antiprotons. 
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(b)  We  will  prove  a  very  important  result  known  as  Furry 's  theorem^  in  quan- 
tum electrodynamics.  Consider  a  process  having  an  odd  number  of  external 
photon  lines.  Then  the  propagator  in  the  momentum  representation  is 

<T(A^,{x,)A,{x.^^-.A,{xn))y.     (56) 
The  operators  A„  are  in  Heisenberg  representation. 
Applying  U(.  we  see 

G'ip^,  P2,  •••.  2^n)  =  /e^"^^^"^"^"-"^---"^""«\^(^JK)...,  A<i{xn))y 

=  {-irG{p„  ...,pn).  (57) 

Since  G^  =  —  G,  G  =  0  if  n  is  odd.  Thus  no  real  process  can  occur  with  an  odd 
number  of  external  photons. 

Time  reflection 

We  would  expect  that  the  time  reflection  T  in  analogy  with  that  of  space 
reflection  should  imply  t-^t'  ^  —t.  We  shall  see  that  this  definition  is  inadequate 
and  requires  an  additional  prescription  which  is  forced  upon  us  by  both  physical 
and  mathematical  considerations.  We  are  led  to  two  types  of  time  reversals  strong 
and  weak,  Tg  and  T^  respectively. 

Assuming  that  the  T  operation  corresponds  to  going  over  from  tto  —  t,  we  proceed 
as  in  the  case  of  space  reflection  and  consider  a  scalar  field  variable  which  becomes 
under  T: 

T(p{x)  T-i  -  cp'(x,  n  =  (p{x,  t)  =  g){+x,  t').  (58) 

Expanding  (p{x,  t)  in  terms  of  creation  and  annihilation  operators  we  have 

q){x)  =yj[a{k)  e'^-'  +  h^k)  e-'^""^]  (59) 

k 

T (p{x)  T-^  =  ^[T a{k)  T-^ e-^<<^-^^^'^^^  ^  2]  T W {k)  T-'^ Q^^^-^^^^i^.         (60) 

k  k 

Thus  we  have 

Ta{k)T-'^  =  W[-k)     or     T  a^k)T-'^  =  h{-k)  (61) 

TW{k)T-^  =  a{-k)     or     Th{+k)T-^=^a^-k),  (62) 

i.e.  creation  operators  of  particles  of  momentum  k  become  the  annihilation 
operator  of  antiparticles  of  momentum  —  k.  This  time  reflection  with  the  inclu- 
sion of  a  phase  factor  r)  is  known  as  the  strong  time  reflection  Tg^  under  which  the 
particle  and  anti-particle  are  interchanged.  However  we  can  define  another  time 
reflection  called  the  weak  time  re^edio?i  T^  known  also  as  Wigner's^  time  reversal. 
Under  T^,  we  have 
Tu(p (^)  T-J=(p^ (x,  f)  ^ricp^ (x,  t) .  (63) 

1  W.  H.  FuRiiY,  Phys.  Rev.  51,  125  (1937).  This  theorem  can  equally  well  be  applied  to  n^ 
decaying  into  two  photons,  see  C.  N.  Yang,  Phys.  Rev.  77,  242  (1956). 

2  J.  ScHWiNGER,  Phys.  Rev.  82,  914  (1951). 

3  E.  P.  WiGNER,  Gottinger  Nachr.,  546  (1932);  Proc.  Nat.  Acad.  Sci.  38,  449  (1952). 
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Proceeding  as  in  the  above,  we  arrive  at  the  following  relations  for  creation  and 
annihilation  operators 

T„«'{fc)7'-i  =  >?a(-fc);         T^bHk)T-^==rjb(-k).  (64) 

The  striking  feature  of  T^,  is  that  the  particle  and  anti-particle  nature  is  not 
disturbed.  The  two  time  reflections  are  related  by 

T,,=  CT,.  (65) 

Since  in  both  the  cases,  the  creation  operators  become  annihilation  operators, 
it  is  physically  reasonable  to  expect  that  under  time  reversal  we  should  also 
change  the  order  of  events,  i.e.  the  initial  state  becomes  the  final  state  and  vice 
versa.  But  this  description  implies  that  in  the  case  of  operators,  it  is  not  just 
enough  to  reverse  the  time,  i.e. 

Toc{t)T-^^(x'{t')  (66) 

but  we  have  also  to  impose  an  additional  rule  in  the  case  of  a  product  of  operators 
{x{t)  Pit).  Precisely  we  have 

Toe [t)  Pit)  T-i  -  P'it')oc'it'),  (67) 

i.e.  we  reverse  the  order  of  the  product  of  operators.  The  mathematical 
consequence  of  this  precsription  is  the  preservation  of  commutation  which  is 
otherwise  not  possible. 

Let  us  apply  T  to  qit)  and  pH),  i.e.  coordinate  and  momentum  operators 

Tqit)  T-^  =^  q'it')  =  qit);  Tpit)  T-^  ^  fit)  =  -pit).  (68) 

The  commutation  relation  between  q'it^),  p'  it')  is 

[q'(l'),  p'(f )]  =  [q{t),  -p(()]  =  -Ml),  vm 

+  [q(t),vm-  (69) 

Hence  it  is  not  invariant  under  the  transformation  T  and  hence  T  is  not  unitary. 
However  accepting  the  prescription  of  the  reversal  of  order  we  find 

T{q(t),  pmT-'  =  [p'in,  q'it')]  =  -lp(t),  q(t)]  (70) 

=  [q{t),p{t)]- 

Under  time  reversal  the  state  vectors  transform  as  follows : 

^|V^>==  <V^'|.  (71) 

<y;|Tt=<^|2^-i=  |^'>.  (72) 

We  shall  now  show  that  T^  is  equivalent  to  complex  conjugation  followed  by 
a  unitary  transformation.  Consider  the  time  dependent  Schrodinger  equation 


dt 
If  we  now  change  ^  to  — t,  we  find 


Hxpit).  (73) 


^•^^'(0=^>'(0-  (74) 
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where  ^'(^')  =  tpi — 0  ^^^  ^'  ^^  obtained  from  H  by  replacing  p  by  — p  and 
reversing  the  order  of  events.  This  reversal  is  of  no  consequence  if  jH^  is  a  sym- 
metric function  of  p  and  q.  Taking  the  complex  conjugate  of  the  above  equation, 
we  find  that  \p*  satisfies  the  same  equation  as  ip  provided 

H'*  =THT-^  =  H.  (75) 

Thus  we  find  that 

^>'{x,n^Ky)''{x,t),  (76) 

i.e.  the  time  reversed  wave  function  is  obtained  by  a  unitary  transformation 

and  a  complex  conjugation  (i).  Thus  the  operator  for  time  reversal  can  be  expressed 

as 

T^=Kx    (complex  conjugation).  (77) 

From  the  transformation  properties  of  q  and  p  it  follows  that  the  angular  momen- 
tum operators  J^  and  J^  should  transform  as 

T„J,T„^  =  -J,  (78) 

or  [Jz,  Ty^^  =  0,  i.e.  J^  anti-commutes  with  T^,.  But 

rp      J2rp-\  __   rp      j  rp-  \  rp      j  rp-  1  _     72 

or  N 

lJ\Ty]  =  0.  (79) 

Hence  the  eigenvalue  of  J^^  i,e,  j{j  +  1)  will  be  a  constant  of  motion  while  that 
of  Jz  will  not  be  a  "good"  quantum  number.^  These  considerations  apply  equally 
well  to  spin  operator  (j. 

Consider  a  state  with  total  angular  momentum  j  and  ^-component  m.  We  have 

J,T^  \jmy  =  -T^J,  \jmy  =  -mT^  |;m>,  (80) 

i.e.  the  state  T^\jmy  is  an  eigenstate  of  J^  with  eigenvalue  — m.  How- 
ever T^\jmy  is  still  an  eigenstate  of  J^  with  eigenvalue  j{j  +  1).  Therefore 
^w\j'^y  =  f^jm^h  — ^  I  where  c^^  depends  on  j  and  m  with  |cy^  p  =  1  (ie)  c^-^  =  ±1. 
We  will  deduce  a  theorem  analogous  to  the  principle  of  detailed  balance  widely 
used  in  statistical  mechanics.^  Let  us  assume  that  the  >S-matrix  is  left  invariant 
under  T^  i.e. 

S  =  T-J^ST^  =  <Mr^,  k^r^,  ...\8\k[,  r^,  fe^r^,  .  .  .> 
=  ±<-k[-r[,  -K-r'„  ...\S\-k,-r„  -k,-r„  .  .  .>  (81) 

where  we  have  described  the  initial  and  final  system  by  the  momenta  k  and 
spin  r  of  the  various  particles.  However  this  result  is  not  identical  to  the  theorem 
of  detailed  balance  which  demands  that 

|<fe,ri,  fe2^,,...|^|fe^r;,  fe;r;,...>p==|<fe;r^,A:;r^,...|^|fciyi,-fe,r2,  ...>F.   (81a) 

^  From  this  result  it  can  be  shown  that  there  cannot  be  an  observable  which  connects 
states  of  integral  and  half  integral  spin.  See  G.  C.  Wick,  A.  S.  Wightman  and  E.  P.  Wignek, 
Phys.  Rev.  88,  101  (1952);  Kemmer  et.  al.,  Rep.  Progr.  Phys.  23,  400  (1959). 

2  J.  HAMn.TON  and  H.  W.  Peng,  Proc.  Roy.  Ir.  Acad.  49,  197  (1944);  F.  Coester,  Phys. 
Rev.  84,  1259  (1951). 
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But  since  we  know  that  k  becomes  — k  under  P,  if  we  also  assume  that 
PSP-^  =  8,  then  (81)  becomes 

<fe,ri,  fegra,  ...|/Sfjfeir(,  fe^r^,  .  .  .> 

=  ±<K~r[;  k'^-r'^,  .  .  .\8\k^- r^,  k^-r^,  .  .  •>     (82) 

where  we  have  space-reflected  the  states.  This  still  differs  from  (81a)  in  that 
the  spin  states  are  different.  We  can  remove  the  dependence  on  spin  states  by 
summing  over  all  possible  spin  states.  Thus 

i;|<fei,  k^,  ...\8\k[,  ki  •.^yf=  S\<K  K^  -'-\^\K  fea-.-M'.    (83) 

Spin  Spin 

Imposing  rotational  invariance  on  8,  the  matrix  element  cannot  depend  upon 
the  m  value.  Then  we  can  obtain  a  detailed  balancing  theorem  for  transitions 
between  states  of  the  same  total  angular  momentum  quantum  number  j : 

iPjm  \8\ocjmy  =  iocj  —  m  \8\^j-my  (84) 

where  oc  and  ^  are  some  other  quantum  numbers  necessary  to  identify  the  "chan- 
nels". This  result  is  very  useful  in  the  study  of  nuclear  reactions. 

Another  interesting  application  of  the  principle  of  invariance  under  time  reversal 
is  the  following.  Defining  a  reaction  matrix  K  through 

the  unitarity  of  >S^  implies  the  hermiticity  of  K.  If  8  is  invariant  under  T^  and 
rotation,  K  is  real.  The  importance  of  the  K  matrix  is  that  it  describes  the  scatter- 
ing process  in  terms  of  a  minimum  number  of  real  parameters.  Using  (85)  we  can 
show  that  , 

t  =  - ^^,  (86) 

where  t  and  k  are  sub-matrices  of  T  and  K  respectively  containing  the  matrix 
elements  which  are  diagonal  in  E,  J,  J^  but  differ  with  respect  to  other  quantum 
numbers  necessary  to  denote  the  various  final  channels  if  available. 

In  the  case  of  single  channel  if  we  define  K  in  terms  of  a  single  real  parameter 
which  we  can  take  to  be  the  phase  shift  d,  i.e. 

k  =  —  t3ind  (87) 

we  have  , 

t  =  —  ei^  sin  6.  (88) 

71 

If  we  consider  the  following  two  channel  reaction 

71^^  -\-  n  :^  TC^  +  n     (elastic  scattering) 

^y  +  P       (radiative  capture  or 

photo  production  of  mesons) 
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we  can  characterize  the  K  matrix  with  the  real  parameters  a{y'p-^yp)^ 
h{n^  +  n:^y  +  p)  and  tan(3(7r+  4-  n-^jf^  +  n).  Since  there  is  only  little  elastic 
yp  scattering,  we  can  set  a  =  0.  We  can  show  that  t  is  given  by 

1   r        0         fte'^^cos^" 

~^   be'^cosd    e^'^sinc^ 


(89) 


The  importance  of  this  result  lies  in  that  the  photo  production  amplitude  involves 
the  same  phase  shift  as  7r+  +  w  scattering.^  The  K  matrix  is  particularly  useful 
when  there  are  many  final  channels  both  elastic  and  inelastic  as  for  example 
in  K-  —  p  or  K-  —  n  interaction s.^ 

Strong  reflection  R,  or  CPT  {T==  TJ 

We  will  now  study  the  effect  of  strong  reflection,  i.e.  reflection  of  both  space 
and  time  coordinates.  This  is  defined  as 

R,x,,  =  x;,  =  -x^.  (90) 

The  effect  of  Rg  on  a  scalar  field  99  {x)  is  given  by 

Rg(p{x)  R;^  -  99' (^')  =  rjsfix).  (91) 

By  familiar  arguments  we  now  obtain  by  comparing  the  coefficients  of  e'^^'^ 
suitably  in  rjs(p{x)  and  q)'  {x') 

R,a{k)R;^  =  rjgbHk) 

R,hHh)R-^  =  ri,a(k).  (92) 

Thus  an  initial  state  of  a  single  particle  with  momentum  h  is  transformed  into 
a  final  state  of  an  anti- particle  of  momentum  h. 

In  obtaining  the  transformation  for  operators,  we  notice  that  we  have  combined 
P  with  strong  time  reflection  as  defined  before. 

The  spinor  field  transforms  as 

Rs V (^)  ^J ^  =  'ip'{x')  =  i7]sy^ip {x) 

Rsip{x)R-^^iritylxp[x).  (93) 

The  corresponding  rules  for  the  annihilation  and  creation  operators  are 

R,a,{k)R;^^{-r)ri,hU{k) 
RsK{k)R;^=={-r)ri,al,{k).  (94) 

Therefore  an  initial  state  of  single  particle  with  momentum  k  and  spin  r  is  trans- 
formed to  a  final  state  of  an  antiparticle  state  with  momentum  k  and  spin  — r,  i.e. 

. Rs  \k,ry  =  {-lYrj,<f„  -r\  (95) 

1  M.  Gell-Mann  and  K.M.Watson,  Ann.  Rev.NucL  Sci.  4,  219  (1954);  G.  Takeda,  Phys. 
Rev.  101,  1547  (1956). 

2  M.  H.  Ross  and  G.  L.  Shaw,  Phys.  Rev.  115,  1773  (1959).  For  an  interesting  alternative 
formulation  for  describing  inelastic  scattering,  see  P.  T.  Matthews  and  A.  Salam,  Nuovo 
Cim.  13,  381  (1959). 
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It  is  interesting  to  see  the  effect  of  CPT  on  a  state 

T\k,ry^  <-fe,-r| 
P<—h,—r\=  <fe,  — r| 
C<M,-r\=  ik,-r\.  (96) 

Thus  CPT  \k,  ry  =  <fe,  -r  |  =  R,  \k,  r>  (97) 

and  Eg  is  equivalent  to  CPT. 

If  we  now  define  Sch winger's  time  reflection  ^  as 

T,  =  R,P, 

it  is  equal  to  CT,  i.e.  to  performing  charge  conjugation  on  the  time  reversed 
state.  This  is  equivalent  to  the  strong  time  reversal  previously  defined.  Tg  switches 
an  initial  particle  state  with  momentum  k  spin  r  to  a  final  anti-particle  of  momen- 
tum — k  spin  — r. 

Following  Matthews  and  Feldman^,  we  will  prove  an  important  result  called 
the  CPT  theorem  or  the  Pauli-Liiders  theorem.^  The  theorem  states  that  any 
physical  theory  invariant  under  proper  orthochronous  Lorentz  transformation 
with  the  usual  relation  between  spin  and  statistics  is  automatically  invariant 
under  strong  reflection  or  CPT^  We  wish  to  stress  that  while  a  proper  Lorentz 
transformation  is  a  continuous  transformation,  CPT  is  a  discrete  transformation 
and  hence  any  result  connecting  these  different  transformations  is  very  remarkable. 

The  propagator  of  a  particle  with  incoming  and  outgoing  momenta  q  and  p 
is  given  by 

(^.^{P,  q,  h)  =  fe-'^P'-'"--\T{tp,{x)ip^{y)^-^)yd'xd'y,  (98) 

where  oc,  ^  specify  the  states  in  more  detail  and  k^  denote  collectively  the  incoming 
and  outgoing  momenta  of  other  kinds  of  particles.  Performing  R^  on  this  gives 

l^^''''-''--Hiy,U<T{ipM%{^))>{iy3),pd^^d^y  (99) 

where  we  have  changed  the  integration  variables  x^—x  and  y^—y.  Using 
the  anti-commutability  (or  Fermi  statistics)  of  ^  field  we  can  write  yj  {y)  ffai^) 
"  — V^a(^)  y^^iy)-  Notice  however  that  this  sign  will  cancel  with  ( — i)^.  Thus 

^«/3  {Pi,qi ,  ki)  =  yA^{—p^,—q^,  —ki)  y^.  (100) 

In  order  that  G^  be  covariant,  it  should  involve  p,  g,  k^  in  the  form  p,  q,  k^  etc. 
Hence  we  have 

r5^./3(-p,  -q,  -k,)  =  G^.^(p,q,k,)  (101) 
since 
75P75  =  -P 


^    J.  SCHWINGER,  loc.  cit. 

2  G.  Feldman  and  P.  T.  Matthews,  Phys.  Rev.  102,  421  (1956). 


3  G.  LuDERS,  Kgl.  Danske  Vidensk.  28,  5  (1954);  Ann.  Phys.  2,  1  (1957);  W.  Pauli,  Niels 
Bohr  and  the  Development  of  Physics,  Pergamon  (1950). 

^  Invariance  under  CPT  implies  that  there  exists  a  state  for  the  anti-particles  correspond- 
ing to  each  possible  state  of  the  particle  with  the  difference  that  the  space  and  time  are  inverted. 
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Thus  the  propagator  is  left  invariant  under  Rg  or  CPT  and  the  theorem  is  proved. 
For  convenience  we  will  collect  all  the  different  transformation  properties  of 
the  scalar  and  spinor  fields  and  exhibit  in  a  table. 


\^Operation 

P 

Ue 

^^^v^ 

x'  =  -x 

particle  -^ 

Rs 

Ru, 

Tu, 

Field^\ 

^  =  0:4 

anti-particle 

x'  =  -x 

Rw^RsUc 

T^=Ru,'P 

<Pv  =  npV{-x) 

Ve^ncV^ 

<Ps  =  11s'P{-X) 

<Pu>  =  'nw<pH-x) 

<PT  =  rij<(p^{-Xi) 

9 

9p'^  =  Vp*  9^  {- X) 

9e^  =  nc*<P 

fs^^Vs'vH-x) 

<Pu,^^nw*fp{-x) 

<PT^  =  nT*9(.-X^) 

scalar 

V  =  i 

hcP  =  i 

^s  =  -ns^{-x) 

or 

for  charged 

pseudoscalar 

»?p  =  1   for  s 

V  =  l 

^s^  =  -  %*  ^t  (-  X) 

Vw='VcVs 

f]T  =  VeVsVp* 

=  -  1  for  p.s. 

for  neutral 

%^=1 

Wp^npy^wi-x) 

Vc  =  »7cc'vt 

Ws  =  insV6y>{-x) 

Vw  =  ir]wV5C*W^(-x) 

WT  =  inTViC''v{-Xi) 

V  =  Vvt(-ac)y4 

We^'-Ve'VC 

V«t  =  -iJ7,>t(-a;)yg 

Ww'^  =  -inw*w{-x)eY6 

WT^  =  -inT*W{-Xi)yJcy^ 

v 

Vp  =  r}p*w{-x)Vi 

We^nc'^CVi 

v«  =  iVv(-«)y5 

Vw  =  '^Vw*V(-x)cyiyi 

VT  =  iVT*v{-Xi)Yjl'cy^y, 

spinor 

np-i 

for  A  type 

h«P  =  l 

ns'  =  ±'^ 

Vw  =  VeVs 

nT-rieVsnp* 

Continuous  transformations  i 

The  continuous  transformations  we  have  till  now  studied  are  those  dealing 
with  space-time  coordinates.  More  generally  we  can  define  a  continuous  trans- 
formation with  respect  to  a  single  parameter  or  a  set  of  parameters,  the  space- 
time  transformations  being  particular  cases  of  these.  In  discussing  this,  we  shall 
first  take  up  the  case  of  a  single  parameter  other  than  space-time.  We  shall  denote 
the  field  operator  99  (x)  as  99  {x,  X)  and  ask  how  (p  {x,  X)  transforms  when  the  para- 
meter X  is  changed  to  A  +  dA  without  x  being  affected.  We  can  write  | 

(p(x,X  +  dX)  =  (1  +  TdiX)(p(x,  X)  (102) 

where  T  is  a  matrix  of  transformation  when  99  (iP,  X)  has  more  than  one  component 
but  is  a  complex  number  for  a  scalar  (p{x,  X).  Suppressing  the  parameter  X,  we 

write 

(p'{x)=  (1  +  TdX)(p{x)  (103) 

or  in  terms  of  the  components  of  99 

<pM=r.(d^^  +  AXT,^),pi,(x).  (104) 

If  we  assume  that  (p'o,{x)  =  u~^  (p^(x)  u,  a  relation  between  u  and  T  can  be 
established  as  follows. 

Setting    u=^  l—iFdX, 

where  F  is  also  defined  equally,  if  u  is  unitary,  F  will  be  Hermitian. 

cp',  =  {l  +  iFdX)(p,{l-iFdX) 


(p^^i[F,  99jd;i. 

T<p  =  i[F,cp-\. 


(106) 
(107) 
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The  solution  for  F  can  be  recognized  to  be 

F  =  iJTi^ix)  T,p<pp(x)  d^x.  (108) 

For  finite  variation  of  A,  we  can  write 

cp'{x)  =  e'^^(p{x).  (109) 

If  the  Lagrangian  density  L  is  invariant  under  such  a  transformation,  we  see 
that  F  will  be  a  constant  of  motion.  Defining, 


this  leads  to  the  equation  of  continuity 


v^^  =  o  (in; 


for  /^ ,  the  vectorial  operator  corresponding  to  F  which  is  consequently  conserved. 
We  shall  now  consider  a  few  applications  of  the  above  result. 

The  complex  scalar  field 

Consider  a  non-Hermitian  field  ^  =  ^^  +  ((p^  where  99^  and  992  ^^^  ^^^^  fields. 
Putting  T^p=  ied^p  the  conserved  operator  F  is 

F  =  i8J[n,[x)cpAx)-nl{x)cpl{x)\.  (112) 

Utilizing  the  expression  for  q)a(x)  and  7r^(a:),  we  get 

F  ==  e{n^  —  n.)  (113) 

where  n+  and  n^  are  total  numbers  of  particles  and  anti-particles  in  the  field.  Choos- 
ing the  constant  e  properly,  we  can  identify  F  with  the  total  charge  of  the  field,  i.e. 

Q=e{n^-n_).  (114) 

Calculating  the  corresponding  vectorial  quantity  /^ ,  making  use  of  the  Lagrangian 
for  the  scalar  field,  we  have 


cp^  8(p 


dx^  dx^ 


(115) 


/.=  - 

which  we  identify  as  the  current. 

The  spinor  field 

In  a  similar  way  we  calculate  /^  for  a  spinor  field 

which  is  the  cm-rent  for  Dirac  field. 

EPCR       11 
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Thus  imposing  a  phase  transformation  9?  ->  6*^99  leads  to  the  existence  of  charge 
and  current  and  invariance  under  the  transformation  assures  the  continuity 
equation.  Hence  this  gauge  is  more  appropriately  termed  the  "charge  gauge". 

The  electromagnetic  field^ 

We  know  from  classical  electromagnetic  theory  that  the  electromagnetic 
field  equations  are  invariant  under  the  gauge  transformations  (first  kind) 

A,(x)^A',{x)  =  A^(x)+^^.  (117) 

X  {x)  is  arbitrary  to  the  extent  that  it  satisfies 

Ux(x)-=0.  (118) 

This  transformation  is  a  gauge  transformation  of  the  first  kind  as  distinct  from 
the  gauge  transformation  of  the  second  kind  defined  by  extending  the  charge 
gauge,  i.e. 

cp{x)-^(p'{x)  =  e''^^''^(p(x).  (119) 

We  find  a  remarkable  feature  now,  that  on  demanding  invariance  of  the  total 
free  Lagrangian  density  L  under  both  the  gauge  transformations,  we  are  led  to  a 
unique  interaction  of  the  charged  fields  with  the  elctromagnetic  field.  We  first 

notice  that  a  term  of  the  type  j  — —  —  i£■A^^  1 99  stays  invariant  under  both  these 
transformations,  i.e.  ^ 


dx^ 


^''^{-^-isA^icpix).  (120) 


However  remembering  that  for  the  Lagrangian  of  the  charged  fields  (whose 

field  variables  are  necessarily  complex)  to  be  hermitian,  only  combinations  99^99 

dw^     dw 
or  — —  can  occur,  we  can  easily  convince  ourselves  that  the  exponential 

ox^^     ox^ 

phase  factor  will  always  cancel.  This  suggests  the  prescription  that  in  the  free 

a 

field  Lagrangian  we  should  replace  djdx^^  ^y"^ ie^^and  thereby  obtain  the 

cx^ 

Lagrangian  in  the  presence  of  the  field.  We  shall  indicate  this  procedure  in  the 
case  of  the  scalar  field  (charged), 

d(p^    dcp 


L  = 


a^.  ex,^^'^'^ 


(121) 


1  C.N.Yang  and  R.L.Mills,  Phys.  Rev.  96,  191  (1954);  R.  Utiyama,  Phys.  Rev.  101, 
1957  (1956).  These  papers  deal  with  the  general  gauge  transformations  which  are  space  de- 
pendent. From  this  theory  it  is  possible  to  deduce  the  existence  of  the  electromagnetic  field. 
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Applying  ovir  prescription  and  after  a  little  rearrangement  of  terms,  we  get 

LF  =  L-j^A^,  +  8^Al^^cp  (122) 


where 


.    ^  d(p  'd(p'  \ 


The  last  two  terms  represent  the  interaction  Lagrangian  L{^^(.  (Figs.  23  a  and  23  b.) 
It  is  interesting  to  verify  the  interaction  of  the  electron  with  an  external 
electromagnetic  field  by  this  method. 


(a) 
Fig.  23.  (a)  Current  term; 


(b)  "Catastrophic"  term. 


The  spin  operator 

We  shall  now  discuss  continuous  transformation  associated  with  space-time. 
We  recall  the  intrinsic  variation  of  the  fields  under  space-time  rotations.  An 
infinitesimal  rotation  of  space-time  is  a  particular  case  of  the  Lorentz  transfor- 
mation. The  change  in  the  fields  due  to  this  transformation  can  be  split  into  two 
parts,  the  change  connected  with  coordinate  dependence  and  that  connected 
with  an  intrinsic  variation.  We  have  already  argued  that  S  is  the  matrix  for 
intrinsic  variation.  Interpreting  it  as  a  gauge  transformation  we  will  obtain  the 
spin  operator. 

Consider  a  change  in  the  field  at  the  same  physical  point  P  under  a  space-time 
rotation.  The  coordinates  x^^  in  the  new  system  after  rotation  are  related  to  the 
coordinates  Xf^  in  the  original  system  by 

4=- 2(^0  + ^>.^)^,'  ■      (123) 

V 

where  e^,^  is  the  Lorentz  transformation  matrix. 

However  for  a  non- scalar  field  with  more  than  one  component  we  can  write 

The  change  in  the  functional  form  is  obtained  by  not  merely  writing  x  in  terms 
of  x'  but  also  by  operating  S^^.  If  we  now  define 


^ocB    —  ^cxB  +   2  2j<xP^fivJ 


(125) 


where^J^  is  an  operator  defined  by  the  above  equations,  then  considering  the 
intrinsic  variation  alone,  the  new  q?  can  be  written  as 

<p'A^)  =  2  (4^  +  *  Si}  ^,.)  ni^)  ■  (126) 

0 
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From  (108)  the  corresponding  observable  is 

^^v  =  /:^«(^)i;^?9'^Wd3^.  (127) 

We  will  now  evaluate  5^^  for  various  fields.  We  have  shown  that  under  a  Lorentz 
transformation  y){x)  becomes  ^'(a;')  with  ^{x^)  =  Sip{x)  and  the  condition  for 
covariance  is 

S-'y,S  =  2%rY,-  (128) 

V 

Substituting 

S=l  +  i2J''''^^.  (129) 

we  find 

Using  the  explicit  representations  of  y^,  s, 

U^"==-W^x,     where     ju,  v,  ^  =  I,  2,  3.  (131) 

Now  . 

s^,=  ily^H^)o,,V{^)d'x,  (132) 

which  becomes  in  the  non-relativistic  limit  (only  large  components  of  ip) , 

s=  -I  Jip^x)<Jy){x)d^x.  (133) 

Thus  we  have  obtained  the  well  known  Pauli  representations  of  spin  (j  in  the 
non-relativistic  region. 

Since  by  definition  the  scalar  field  transforms  as 

(p'{x')  =  (p{x), 

we  note  there  is  no  intrinsic  change  for  scalar  or  pseudoscalar  field  and  hence 
their  quanta  are  spinless.  A  vector  field  99^,  transforms  as  the  coordinates  by 
definition,  i.e. 


We  recognize  that 

Setting 

we  obtain 


Sfzv  =  sA^,  A'.  ^     cyclic     1,  2,  3), 

s-  f  {jtx(f)d^x.  (136) 


We   can  identify  5^,  ^g,  ^3   to   be   the   well  known   matrices   for   spin  1    (see 
chapter  II-3).  Thus  the  vector  field  has  spin  1. 

Since  we  have  already  estabhshed  the  correspondence  between  spin  and  iso- 
topic  spin  formahsms,  it  is  clear  that  the  above  considerations  can  be  carried 
over  to  rotations  in  isotopic  spin  space.  Thus  the  nucleon  field  has  two  compo- 
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nents  (2  component  Pauli  spinor)  i.e.  proton  and  neutron  and  hence  its  isotopic 
spin  vector  is 

I=-i  fy)'^ry)d^x,  ^  (137) 

where  t  is  identical  with  a.  In  particular  for  a  rotation  around  the  third  axis 

Iz=-ifw''r^W^^^  (138) 

and  will  be  conserved.  The  nucleon  is  thus  an  isospinor.  Similarly  we  can  treat 
the  meson  field  which  has  three  components  (corresponding  to  two  charged 
particles  and  a  neutral  particle  jz^,  n^)  as  a  vector  and  obtain 

I  =  l{:tx(p)d^x.  (139) 

The  isotopic  spin  accordingly  is  1,  the  yr-meson  is  therefore  an  isovector.  We  are 
already  aware  that  the  importance  of  isotopic  spin  arises  only  when  one  introd- 
uces charge  independence.  Since  charge  independence  of  the  interaction  is 
equivalent  to  invariance  of  the  interaction  under  any  rotation  in  charge  space 
the  interaction  should  be  a  scalar  in  charge  space.  The  scalar  product  of  two 
vectors  is  obviously  an  invariant  and  hence  we  form  the  scalar  product  of  the 
isotopic  spin  vector  and  the  meson  field  which  itself  is  an  isovector.  Thus  the 
charge  independent  Yukawa  interaction  for  a  pseudoscalar  meson  field  is 

or  in  terms  of  the  observable  q)+,  (p_,  cpQ 

=  y2Gy)r_(p_f  +  y2G{pr+(p+y)  +  Gfr^cp^ip  (141) 

which  is  the  Kemmer  Hamiltonian  to  be  discussed  later.  Here 

_  ^1  +  i  9^2  _  9^1  -  ^  9^2 

'^■'~       1/2  ^-~        p       ' 

We  note  that  neutral  mesons  are  oppositely  coupled  to  the  neutron  and  the 
proton,  i.e.  Gp  =  — G^.  From  the  expression  it  is  also  clear  that  the  probability 
for  emission  or  absorption  of  a  charged  meson  is  (|/2)2,  i.e.  twice  the  probability 
for  the  emission  or  absorption  of  a  neutral  meson. 

The  interaction  of  the  charged  particle  with  the  electromagnetic  field  depends 
upon  the  charge  and  hence  on  /g .  The  interaction  is  of  the  type 

where 

?A.==44v^r/.(i  +  '^3)v^]> 

the  term  (1  +  T3)  ensures  that  the  neutron  has  no  interaction  with  the  electro- 
magnetic field.  The  interaction  behaves  like  the  third  component  for  rotations 
in  the  isotopic  spin  space.  Thus  though  the  interaction  remains  invariant  under 
rotations  about  the  third  axis,  i.e.  it  will  conserve  I^  and  hence  the  charge,  it  will 
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not  remain  invariant  under  arbitrary  rotations  in  charge  space.  Thus  the  total 
/-spin  will  not  be  a  good  quantum  number. 

Charge  symmetry  was  originally  introduced  in  nuclear  forces  to  denote  that 
the  nuclear  forces  between  two  neutrons  is  the  same  as  that  between  two  protons. 
The  operation  implied  here  is  changing  a  proton  to  a  neutron  or  vice  versa  and 
charge  symmetry  is  a  consequence  of  in  variance  under  this  operation.  By  rotating 
through  d  =71  about  the  second  axis  in  charge  space  we  can  perform  this  operation 
which  we  will  denote  by  T^{7i).  Because  of  the  identity  in  the  operator  relations 
between  angular  momentum  and  isotopic  spin,  this  operator  for  a  spinor  is  e^'^-'' 
and  for  9?  is  e**''"  where 


0  0 

0  ] 

0  0 

-i 

0  i 

0 

"0  0  r 

000 

«3  = 

_i  0  0 

0 

-pi  0 

pi 

0      0 

0 

0      0 

Lee  and  Yang^  have  defined  an  extended  charge  conjugation,  i.e.  G  operator, 
which  is  a  product  of  charge  symmetry  and  charge  conjugation 


Thus 


and 


G=T^{7i)U, 


O 


V 


G^-i  = 


G(pG~^ 


G  anti-commutes  with  the  heavy  particle  number  operator,  i.e.  \G,  N^^  =  0.  Also 
IGS]^  =  0,  G^=  (— 1)^+'^,  [G,  I]  =  0.  It  can  be  seen  that  G  is  equivalent  to 
reflection  in  isotopic  spin  space  for  pions.  If  interactions  are  invariant  under  G 
then  a  state  with  even  number  of  pions  cannot  go  into  odd  number  of  pions. 


1  T.  D.  Lee  and  C.  N.  Yang,  Nuovo  Cim.  3,  749  (1956). 

2  S.  Weinberg,  Phys.  Rev,  112,  1375  (1958). 


CHAPTER  IV 

PION  PHYSICS 

1.  INTRODUCTION 

The  study  of  quantum  electrodynamics  is  considered  so  satisfactory  that  it 
has  led  to  the  belief  that  it  has  reached  a  state  of  completeness.  This  belief  is 
perhaps  well-grounded  because  of  the  validity  of  the  perturbation  expansion 
and  the  high  accuracy  attainable  by  taking  only  a  few  terms  of  the  expansion 
in  powers  of  e^.  The  chief  characteristic  of  the  electromagnetic  interaction  is 
that  it  is  linear  in  the  photon  field  variable  which  means  that  only  a  single  photon 
can  be  emitted  or  absorbed  at  a  vertex.  The  concept  of  the  basic  interaction 
merely  consists  in  assuming  creation  and  annihilation  of  particles  at  a  vertex,  the 
number  of  such  vertices  being  very  small.  The  question  arises  whether  these 
concepts  can  be  carried  over  to  the  study  of  interactions  of  other  types,  e.g.  the 
Yukawa  interaction.  It  was,  in  fact,  application  of  such  considerations  to  the 
problem  of  nuclear  forces  that  led  Yukawa  to  predict  the  existence  of  the  meson.^ 
But  the  situation  here  is  rendered  difficult  since  (as  it  turns  out  to  be  the  case),  the 
vaUdity  of  the  perturbation  expansion  is  questionable  and  even  granting  it,  the 
number  of  terms  of  the  expansion  to  be  taken  which  will  lead  to  results  in  agreement 
with  experiment  is  not  known. 

The  various  attempts  to  develop  a  meson  theory  can  be  broadly  classified 
as  follows : 

(1 )  The  weak- coupling  approximation  in  which  Feynman's  methods  in  quantum 
electrodynamics  are  applied  to  meson  problems. 

(2)  The  strong  coupling  theory  of  Wentzel,  Heisenberg  and  others. 

(3)  Phenomenological  theories. 

(4)  Other  theoretical  methods  for  the  treatment  of  strong  interactions  of  which 
we  shall  be  considering  only  Chew's  approach  in  detail.  But  before  discussing 
these  various  approaches,  it  would  be  interesting  to  study 

(i)  the  properties  of  the  meson,-  and 

(ii)  certain  striking  features  of  the  meson-nucleon  interaction  in  the  low 
energy  region. 

Properties  of  the  jt-meson 

The  existence  of  a  particle  with  mass  approximately  300  times  that  of  the 
electron  was  postulated  by  Yukawa  in  an  attempt  to  explain  the  nuclear  forces 
of  very  short  range  and  great  strength  as  arising  due  to  the  exchange  of  such 

1  H.  Yukawa,  Proc.  Phys.  Math.  Soc.  Japan  17,  48  (1935). 
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a  particle  between  the  nucleons.  Since  the  proton  is  positively  charged  and  the 
neutron  uncharged  three  types  of  particles  can  be  exchanged,  positively,  neutrally 
and  negatively  charged  particles,  tt^,  ti^  and  n~  respectively.  Their  existence  was 
confirmed  experimentally  in  cosmic  rays^  and  later  they  were  copiously  produced 
in  the  laboratory  in  nucleon-nucleus  colUsions  in  high  energy  accelerators.  The  fact 
that  TT^  and  7i~  are  singly  charged  is  verified  by  reactions  like 

n-  +  p-^n  +  y  (1) 

71+  +  D-^2p. 

Both  the  charged  and  neutral  pions  are  unstable  particles  with  lifetimes  which 
can  be  determined  by  studying  their  decays.  The  charged  pion  decays  into  the 
lighter  particle,  the  ju-meson  and  a  neutrino  or  an  electron  and  a  neutrino 

7i~  ^  jLi~  +  V  (2  a) 

7z;+  ->-  e+  +  r 

71-  ^e-  +  V.  (2b) 

The  ^-mesons  unlike  the  jz  do  not  have  strong  interactions  with  matter  but  decay 
according  to  the  scheme 

ju^^e^  +  V  4-  V.  (3) 

Originally  the  ^-meson  was  mistaken  to  be  the  Yukawa  particle  but  since  it 
was  found  that  it  did  not  interact  with  the  nuclei  of  matter,  further  search  led  to 
thediscovery  of  the  Tr-meson,  through  its  decay  into  the  /^-meson,  in  experiments 
in  cosmic  ray  process.  (2  b)  has  a  branching  ratio  of  (1.2  ±  0.3)  x  10~*  as  com- 
pared to  a  value  -^100  for  process  (2  a).  The  lifetime  of  the  charged  pion  is  there- 
fore determined  by  process  2(a)  and  is  found  to  be  (2.55  di  0.05)  x  10"®  sec. 

The  interaction  which  causes  the  decay  is  "weak"  compared  to  the  electro- 
magnetic and  more  so  with  respect  to  the  "strong"  nuclear  reactions  and  has 
been  the  subject  of  considerable  theoretical  study,  as  we  shall  see  later  (chapter  VII). 
Process  (2  a)  is  understood  as  going  through  two  virtual  processes — a  strong 
process  in  which  the  pion  goes  over  into  a  nucleon-anti-nucleon  pair  and  a  weak 
one  in  which  this  pair  is  connected  to  the  final  system  of  a  ^-meson  and  a  neutrino 
by  a  Fermi  interaction.  On  this  view  point,  the  small  branching  ratio  of  (2  b)  is 
explicable  and  a  satisfactory  quantitative  explanation  of  this  problem  has  been 
given  recently  (see,  chapter  VII). 

The  neutral  pion  decays  very  fast  by  means  of  the  electromagnetic  processes 

7i^->y+y  (4a) 

7Z^->e+ -{- e- +  y.  (4b) 

The  branching  ratios  for  processes  (4a)  and  (4b)  are  98.8  and  1.2  respectively. 
It  is  through  the  observation  of  the  secondary  electron-positron  pair  produced 

1  C.  M.  G.  Lattes  et  al.  Nature  Lond.  160,  453,  486  (1947). 
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by  the  y-rays  in  reaction  4(a)  or  the  pair  produced  in  reaction  4(b)  that  the  life- 
time of  the  neutral  pion  is  determined.  It  is  estimated  to  be  less  than  4  x  lO^^^sec. 
Here  again  the  decay  process  is  understood  to  go  in  two  stages,  a  strong  reaction 
followed  by  an  electromagnetic  one. 

The  masses  of  7r+  and  n'  have  been  found  to  be  equal,  within  experimental 
errors,  the  value  being  --'273.3  m^  (where  m^  is  the  mass  of  the  electron).  The 
existence  of  71^  is  inferred  from  its  decay  into  two  y-rays  which  can  be  observed 
by  the  electron -positron  pairs  they  produce  as  mentioned  earlier.  The  direct 
determination  of  the  mass  of  n^  is  obviously  difficult.  An  indirect,  but  accurate 
measurement  is  based  on  the  reaction 

n~  +  J) -->- n -^  71^  (5) 

i ^2y 

where  the  mass  difference  between  ti"  and  n^  is  measured.  Neglecting  the  kinetic 
energy  of  tt"  in  comparison  with  the  neutron  proton  mass  difference,  a  simple 
calculation  yields  the  n^  —  7i~  mass  difference  which  is  determined  by  observing  the 
energy  distribution  of  the  y-rays.  This  small  difference  of  about  5  MeV  is  amplified 
by  Doppler  effect  into  a  large  energy  spread  for  the  y-rays  of  about  30  MeV. 
The  spin  of  the  charged  pion  can  be  deduced  from  the  reaction 

^  + ;p<->Z)  +  ^+.  (6) 

By  the  principle  of  detailed  balance  the  forward  and  backward  reactions  differ 
only  by  statistical  weights  and  it  can  be  shown  that 

G  (production  oin^)        Z{2S+\)k'^ 


or  (absorption  oin^)  2K^ 


(7) 


where  k  and  K  are  the  meson  and  proton  momenta  respectively  in  the  cm. 
system  and  S  is  the  spin  of  the  pion.  Experiments  performed  at  the  same  energy 
and  scattering  angle  in  both  cases  give  a  value  S  =  0. 

The  experimentally  observed  fact  that  the  neutral  pion  decays  into  two  photons 
(and  not  three)  immediately  determines  the  spin  of  the  jz^.  If  we  assume  the  tz^ 
to  have  a  spin  equal  to  1,  then  in  its  rest  system,  its  wave  function  must  transform 
like  a  vector  which  determines  the  spin  direction.  The  final  wave  function  of  the 
two  photons  involves  three  vectors  ^i  and  63  the  two  polarizations,  and  k  the 
relative  momentum.  The  wave  function  must  be  bilinear  in  f  ^  and  63  and  sym- 
metric with  respect  to  the  interchange  of  the  two  photons  (since  they  are  bosons). 
Further,  because  of  the  trans versality  condition  we  have  Si  -  k  =  82  •  k  =  0, 
Now  the  only  vectors  bilinear  in  ei  and  s^  that  we  can  construct  are : 

(i)  61  X  62 
(ii)  (fi-62)fe 
(iii)  fe  X  (61  X  62)  =  fi(fe  •  62)  —  82{k-  Si)  (7a) 

(i)  and  (ii)  are  excluded  since  both  change  sign  when  the  photons  are  interchanged 
and  (iii)  vanishes  identically  because  of  the  transversality  condition.  Thus  we  find 
that  a  particle  with  spin  1  can  never  decay  into  two  photons. 
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If,  on  the  other  hand,  the  n^  has  zero  spin,  its  wave  function  is  scalar  or  pseudo- 
scalar  depending  on  its  parity.  We  can  now  form  two  possible  final  wave  functions 

(iv)  61  •  82 

(8) 
(v)  (f  1  X  f  2)  •  fe 

This  decay  is  thus  possible  and  if  we  exclude  spins  greater  than  one  (which  is 
also  necessitated  on  grounds  of  renormalizabilitj^)  we  arrive  at  the  conclusion 
that  the  71^  has  zero  spin. 

Under  a  space  reflection  k  will  change  sign.  Hence  (iv)  is  a  scalar  and  (v)  is 
a  pseudoscalar.  This  shows  that  the  decay  photons  of  a  scalar  meson  have  parallel 
polarizations  while  a  pseudoscalar  pion  decays  into  photons  which  are  perpendi- 
cularly polarized.  Thus  a  determination  of  the  polarization  of  the  decay  photons 
determines  the  intrinsic  parity  of  n^  which  is  found  to  be  odd  (with  respect  to 
thenucleon). 

The  invariance  of  the  interaction  of  the  pion  with  nucleons  under  the  parity 
operation  (space  reflection)  can  be  made  to  yield  the  parity  of  the  71: "-meson  with 
respect  to  the  nucleon  (the  parity  of  which  is  assumed  to  be  even).  Let  us  consider 
the  process 

7i~  ^  D  ^n  +  n.  (9a) 

We  know  that  the  processes 

71+ ^D^^'p  +  p  (9b) 

have  the  same  matrix  element,  by  the  principle  of  detailed  balance  and  therefore, 
as  before,  the  ratio  of  cross-sections  will  contain  a  weight  factor  (2/S  +  1)  =  1 
since  /S  =  0  for  the  pion.  The  capture  of  n~  takes  place  mainly  from  the  1 S  orbit 
since  the  18  wave  function  of  the  tt"  is  almost  "inside"  the  nucleus.  Thus  the 
initial  system  in  9  (a)  has  J  =  1  and  its  parity  is  the  same  as  that  of  the  relative 
parity  of  7t~  with  respect  to  the  nucleons.  The  only  state  of  the  two  neutrons 
with  J  =  1  (which  obeys  the  Pauli  principle)  is  the  SP^  state  which  has  odd 
parity.  Assuming  that  n  and  p  have  even  relative  parity,  we  can  deduce  from 
parity  conservation  that  7r~-meson  has  odd  (intrinsic)  parity.  By  charge  in- 
dependence, 71'+  and  71^  should  also  have  odd  intrinsic  parity. 

The  resonances  in  the  low  energy  pion-nucleon  interaction 

We  next  consider  the  most  striking  feature  of  the  pion-nucleon  interaction  as 
well  as  that  of  the  interaction  of  the  electromagnetic  field  with  the  pion-nucleon 
system,  viz.  the  occurrence  of  resonances  in  the  total  cross-sections  for  the 
scattering  of  pions  by  nucleons  and  the  photo  production  of  mesons  from  nucleons. 
The  interaction  of  charged  mesons  with  protons  can  proceed  in  the  following  ways 

71+  +  p -^  71+  +  p  (I) 

71-  +  p -^  71-  -\-  p        (II) 

71-  +  p^Ti^  +  n     (III) 

71- +  p-^n-^y       (IV)  (10) 
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We  shall  not  deal  with  (IV)  as  its  cross-section  is  known  to  be  very  low.  For 
processes  (I)  to  (III),  experiments  show  that  in  the  region  around  100  MeV 
(kinetic  energy  of  the  pion  in  the  c.m.s.)  there  is  a  rapid  rise  in  the  total  cross- 
sections  culminating  in  a  dominant  peak  at  about  200  MeV,  indicating  a 
resonance.  The  association  of  a  33  state  (where  the  Fermi  notation  has  been 
used,  the  first  number  referring  to  twice  the  total  isotopic  spin  and  the  second 
to  twice  the  total  angular  momentum  of  the  state)  with  this  resonance  can  be 
argued  out  on  the  basis  of  the  odd  intrinsic  parity  of  the  pion  and  the  requirements 
of  charge  independence.  Recent  experiments  on  the  scattering  of  jr~  and  jr+  by 
hydrogen  at  energies  from  400  to  1200  MeV  reveal  (see  Fig.  24)  that  the  7e+  cross - 
section  falls  off  above  the  33  resonance  and  has  a  minimum  at  about  650  MeV 
and  then  steadily  rises i.  The  tz"  cross-section,  on  the  other  hand  shows  two  clear 
peaks  at  600-650  MeV  and  at  about  900  MeV.  This  fact  and  the  large  polarization 
of  the  recoil  protons  observed  at  700  MeV  (which  means  a  strong  interference 
between  states  of  opposite  parity)  lead  one  to  characterize  the  first  of  these  states 
as  a  13  state,  the  parity  of  this  state  being  odd  (a  D-state).  A  further  maximum 
has  also  been  observed  at  about  1.5  BeV  in  the  tl^  cross -section.  The  differential 
elastic  coss-section  throghout  the  energy  range  considered  is  sjmametric  about 
6  =  90°,  having  peaks  in  both  the  forward  and  backward  directions. 

The  data  on  the  two  photo-production  processes 

in'^  +  n      .  (v) 

71^  +  p  (vi) 
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Fig.  24.  Total  cross-section  for  the  scattering  and  photo  production  of  pions. 


1  R.  F.  Peierls,  Phys.  Rev.  118,  325  (1960). 
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are  given  in  Fig.  (24).  In  addition  to  the  (33)  resonance,  there  is  a  second  maximum 
at  an  energy  somewhat  below  that  corresponding  to  the  peak  in  7t~  scattering 
and  apparently  rising  towards  a  third  peak  at  1100  Me V  photon  energy.  The 
total  cross-section  for  production  of  n^  has  a  small  second  peak  at  an  energy 
corresponding  to  the  scattering  peak  and  another  small  peak  at  about  1100  MeV. 
The  above  are  only  approximate  representations  of  the  experimental  data, 
some  of  which,  particularly  at  higher  energies,  have  poor  statistical  accuracy. 
Thus  we  can  roughly  summarize  the  different  resonant  states  (considered  as  the 
excited  states  of  the  nucleon)  in  the  following  table. 


Energy  (MeV) 

/ 

J 

P  (=  parity) 

1300 

'1. 

V. 

? 

900 

V2 

V2 

? 

600 

'U 

V2 

- 

200 

V2 

'U 

+ 

0 

V2 

V2 

+ 

As  we  shall  see  presently  there  is  a  good  theoretical  explanation  for  the  (33) 
resonance,  the  shape  and  position  of  which  can  be  fitted  with  the  experimental 
data  by  the  introduction  of  two  parameters.  The  same  can  hardly  be  said  of  the 
other  resonances.  Some  attempts  have  been  made  to  explain  the  second  and  third 
resonances  on  the  basis  of  the  existence  of  resonances  in  the  pion-pion  inter- 
action ^  (see  below).  We  shall  be  considering  only  the  (33)  resonance. 

2.  EARLIER  ATTEMPTS  AND  THEORIES 

Weak  coupling  theory  ^ 

This  theory  is  a  simple  minded  extension  of  Feynman's  methods  which  lead 
to  such  excellent  experimental  agreement  in  electromagnetic  processes,  to  the 
interaction  of  the  mesons  with  the  nucleons.  As  mentioned  earlier  because  of  the 
largeness  of  the  coupling  constant  (in  this  case  g^  ^  15),  we  do  not  expect  to  get 
results  agreeing  with  the  experiments.  The  lowest  order  Feynman  diagrams  for 
the  scattering  processes  (I)  to  (III)  (equation  (10),  sec.  1)  are  shown  in  Fig.  (25). 
It  is  readily  seen  that  since  a  proton  can  emit  but  not  absorb  a  71+  and  similarly 
a  neutron  can  absorb  but  not  emit  a  ji;+,  only  diagram  25(a)  contributes  to  the 
process  (I)  (equation  (10),  sec.  1)  and  only  diagram  25(b)  to  the  process  (II). 
Both  the  diagrams  contribute  to  the  charge -exchange  scattering  process  (III). 
Evaluating  the  matrix  elements  in  the  usual  way,  squaring,  averaging  over  the 
spin  of  the  initial  nucleon  and  summing  over  the  spin  of  the  final  nucleon,  the 
differential  cross-section  in  the  laboratory  system  can  be  found.  We  give  the 


^  See  e.g.  K.  Itabashi  et  al.,  I.N.S.  Report  14. 

2  See  R.  E.  Marshak,  Meson  Physics,  McGraw-Hill,  New  York  (1952). 
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results  for  only  the  total  cross- section  which  can  be  obtained  by  integrating  the 
differential  cross-sections.  For  the  extreme  relativistic  case  and  for  pseudoscalar 
coupling  between  the  meson  and  the  nucleon  we  get 

28, 


Gi=- log — ^ 
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Fig.  25.  Meson-nucleon  scattering. 
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For  pseudovector  coupling  the  expressions  are 

n{  —     me, 
nf'me. 


"■-''^11 


On 


OllJ 


(2) 


Here  g  and  /  are  the  pseudoscalar  and  pseudovector  coupling  constants  of  the 
charged  pion  with  the  nucleon  g^  and  gp,  and/^  and  /^  are  the  coupling  constants 
of  the  71^  with  the  neutron  and  proton  respectively,  ju  is  the  mass  and  e,  the  energy 
of  the  incident  pion  and  m  the  nucleon  mass.  The  pseudoscalar  coupling  gives 
an  essentially  constant  cross- section  as  the  energy  of  the  pion  increases  while 
the  pseudovector  coupling  gives  a  continuous  increase  of  cross-section  with  energy, 
both  of  which  are  in  contradiction  with  experiment. 

The  ratio  between  the  7i;+  and  jr~  direct  scattering  is  about  1.5  for  ps  coupling 
and  1  for  pv  coupling  at  50-100  MeV.  Experimentally,  the  ratio  is  about 
5  at  60  MeV.  Further  the  ratio  of  the  charge  exchange  to  direct  scattering  is  not 
correctly  predicted  by  the  weak  coupling  theory  for  either  coupling. 
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Turning  now  to  the  application  of  the  weak  coupling  theory  to  the  photo 
production  of  a  meson  from  a  nucleon,  we  note  that  the  following  types  of  diagrams- 
enter  the  calculation : 


> 


(b) 


(c) 
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(f) 
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Fig.  26.  Photo  production  of  pions. 
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Figs.  26  (a)  to  (h),  contain  the  relevant  diagrams  for  the  production  of  7r+,  7t~  and  71^ 
as  indicated.  (The  anomalous  magnetic  moments  of  the  nucleons  are  neglected.) 
For  obvious  reasons,  Figs,  (a),  (d),  (g)  and  (h)  may  be  called  "shaking  off"  transi- 
tions, Figs,  (b)  and  (e)  "photoelectric"  transitions  and  Figs,  (c)  and  (f)  "cata- 
strophic" transitions.  It  may  be  noticed  that  neither  the  "photoelectric"  nor  the 
"catastrophic"  transition  is  present  in  the  case  of  tt^  production  since  it  is  neutral. 
At  moderate  energies  the  two  contributions  from  the  two  diagrams  (g)  and  (h) 
just  about  cancel  each  other  so  that  we  should  expect  a  small  cross-section  for  n^ 
production.  Also  n^  should  not  be  produced  from  neutrons.  Both  these  results 
disagree  with  experiment.  At  about  300  MeV  the  cross-sections  for  neutral  and 
charged  mesons  are  of  the  same  order  of  magnitude  (Fig.  24),  the  peak  for  n^ 
being  in  fact  slightly  higher.  Kaplon^  tried  to  improve  this  state  of  affairs  by 
including  the  interaction  of  the  anomalous  magnetic  moment  of  the  nucleon  with 
the  photon.  He  did  get  a  better  yield  for  71^  but  the  angular  distributions  of  the 
mesons  predicted  by  the  earlier  weak  coupling  theory  as  well  as  Kaplon's 
improved  version  are  found  to  be  in  disagreement  with  experiment.  Thus,  as 
expected  from  the  beginning,  the  weak  coupling  theory  is  not  able  to  give  correct 
results  for  meson  interactions. 

Strong  coupling  theory 

Realizing  the  large  value  of  the  coupling  constant  even  during  the  early  develop- 
ment of  the  meson  theory,  Heisenberg^,  WentzeP  and  others*  assumed  that  many 
virtual  mesons  will  be  closely  associated  with  the  nucleon  (unlike  in  the  weak 
coupling  theory  where  only  a  single  meson  is  so  associated)  and  therefore  it  is 
permissible  to  treat  the  nucleon  classically.  At  that  time,  the  classical  approach 
was  the  only  one  possible  as  the  technique  of  renormalization  and  the  consequent 
solution  of  the  self-energy  problems  were  not  available.  The  only  way  then  to  avoid 
the  difficulties  due  to  divergences  was  to  postulate  a  fixed  extended  source.  Meson 
scattering  was  considered  as  a  small  perturbation  of  the  intrinsic  field  of  the 

nucleon.  The  calculation  of  the  cross-section  using  the  strong  limit  viz.  —  >  /^  with 

1  1       2   r 

the  meson  energy  co  <C  — ,  where  —  =  —  /  |  v{k)\^dk  and  v{k)  is  the  Fourier 
ci'  a       7t  J    -^ 

transform  of  the  source  distribution  function    leads  to  a  resonance  denominator 

1  J  .  . 

in  the  sense  that  at  co  =  £  the  cross-section  becomes  a  maxi- 


'-m*m 


mum.  Here  s  =  — 73^ —  where  /J  is  the  pseudovector  rationalized  unrenormalized 

fr 

1  M.  F.  Kaplon,  Phys.  Rev.  83,  712  (1951). 

2  W.  Heisenberg,  Z.  Phys.  113,  61  (1939). 

=^  G.  Wentzel,  Helv.  Phys.  Acta  13,  269  (1940). 

*  W.  Paitli,  Meson  Theory  of  Nuclear  Forces,  Interscience  Publishers,  New  York  (1946). 
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coupling  constant.  Thus  the  strong  coupHng  theory  predicts  the  characteristic  reso- 
nance in  the  meson-nucleon  scattering.  But  the  most  important  feature  of  the  theory 
is  the  prediction  of  excited  or  "isobaric"  states  of  the  nucleon  the  first  of  which 
will  be  the  /  =  J  =  ^/g  state.  The  ground  state  i.e.  the  state  of  the  single  nucleon 
has  J  =  ^/a-  The  excitation  energy  is 

^=2£  (J +  =■/,)  (J -1/2).  (3) 

Now  if  the  isobar  is  to  be  stable  against  the  emission  of  a  meson,  for  the  J  =  2/2 
state  we  should  have  6e  <^  whereas  for  resonance  £  >  ^a.  Thus  the  existence 
of  a  resonance  rules  out  the  possibility  of  a  bound  (stable)  isobaric  state. 

Though  the  strong  coupling  theory  predicts  one  of  the  most  striking  features 
of  low  energy  pion-nucleon  interaction  it  can  be  objected  to  on  the  grounds 
that  (i)  "the  treatment  of  the  meson  cloud  classically  is  not  permissible  since  the 
number  of  mesons  of  moderate  momentum  is  really  not  large  (because  of  the 
small  value  of  the  pseudovector  coupling  constant  /^  ^  0.1)  (ii)  the  absence  of 
renormalization,  which  eliminates  self  energy  effects  which  in  turn  play  a  decisive 
role  in  the  strong  coupling  theory  is  a  serious  defect  and  (iii)  the  theory  cannot  handle 
problems  of  the  production  of  additional  mesons.  We  shall  later  see  that  Chew's 
theory  which  also  uses  the  concept  of  a  fixed  extended  source  meets  all  these 
objections. 

Phenomenological  theories 

It  has  been  emphasized  that  perturbation  methods  are  inapplicable  to  inter- 
actions involving  strong  forces  which  exist  in  the  case  of  the  meson-nucleon 
system.  Therefore,  phenomenological  theories  were  attempted  to  explain  the  main 
features  of  meson-nucleon  interactions.  A  particularly  successful  one  is  Brueckner's 
theory  of  meson-nucleon  scattering  and  photo-meson  production  which  satis- 
factorily explains  the  main  features  at  low  energies.^ 

Brueckner  made  the  important  assumption  that  the  scattering  amplitude 
depends  only  on  the  total  isotopic  spin  I  of  the  meson-nucleon  system  and  not 
on  /g.  On  this  assumption  the  scattering  process  is  completely  described  by  the 
two  amplitudes  a  (|)  and  a{\)  corresponding  to  the  two  isotopic  spin  states  of 
the  meson-nucleon  system,  namely,  /  =  ^/g  and  /  =  ^/g.  Of  course,  the  scattering 
depends  also  on  the  angular  momentum  quantum  number  J  which  we  shall 
consider  a  little  later.  For  the  present,  we  shall  analyse  the  dependence  of  scat- 
tering on  the  isotopic  spin  state  of  the  system. 

In  any  process  in  which  the  scattering  or  the  collision  cross-section  depends 
only  on  the  isotopic  spin  states,  the  general  procedure  for  calculating  the  cross- 
section  is  as  follows.  The  initial  system  of  particles  is  to  be  expressed  as  a  linear 
combination  of  definite  isotopic  spin  states  since  the  amplitudes  of  the  scattering 


1  K.  A.Brueckner,  Phys.  Rev.  80,  106  (1952);   also   "Mesons  and  fields"   Bethe  and 
DB  HoPFMANN,  see  Vol.  II,  Row  Peterson  (1955). 
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are  known  for  the  isotopic  spin  states.  This  linear  combination  of  isotopic  spin 
states  is  re-expressed  in  terms  of  the  possible  final  systems  of  particles.  This 
final  system  of  particles  may  in  general  be  different  from  the  incident  system. 
Thus  the  amplitude  of  scattering  for  any  particular  process  is  expressed  in  terms 
of  the  total  isotopic  spin  scattering  amplitudes  (see  equations  4-6). 

The  71+  —  p  system  (^  =  ^/g)  can  exist  only  in  the  /  =  ^/g  state ;  the  correspond- 
ing normalized  wave  function  is  given  by  rjp7i+.  However  the  7i~  +  p  or  n^  +  n 
system  for  which  I^  =  — ^/g  can  exist  both  in  the  /  =  ^/^  and  /  =  ^/g  states. 

vn^=fU'pei2)+ru<^{'k)-  (4) 

Conversely  the  two  isotopic  spin  states  ^(^/a)  and  0(J)  can  be  expressed  as 
linear  combinations  of  the  wave  functions  of  {p  +  7t~)  and  {n  +  ti^)  systems. 

^^{^U)-fUr]p^~-fl3Vn^  (5) 

Let  a  (3/2)  and  a  (^/g)  be  the  scattering  amplitudes  for  the  two  isotopic  spin  states 
/  =  ^/g  and  /  =  1/2.  Then  the  amplitudes  for  the  processes  (I),  (II)  and  (III) 
(section  1  equation  10)  are  given  by 

pJi^'-^pTi'-^lsai^/^) 

p7i-->p7z-^U[a{^l^)  +  2a{^l^)]  (6) 

pn-^n7z'^UfU[aei,)~a{^/,)]. 

We  shall  now  consider  the  cross-sections  of  the  above  processes  under  three 
particular  assumptions. 

(a)  Let  a{^U)^0,           a  (^/g)  +  0 
Then  oj  :  an  '•  cfiii  =  9:1:2                                                (7a) 

(b)  Let  a(V2)  =  «e/2) 
Then  Gj  :  On  ■  oui  =1:1:0                                                  (7b) 

(c)  Let  a(V2)+0,          ^(^/a)  =  0 
Then 


Gj  :  Gji :  Gin  =  0:2:1 


(7  c) 


The  experimental  values  at  120  MeV  and  144  MeV  closely  conform  to  the  ratio 
9  :  1  :  2.1  Thus  we  are  led  to  the  conclusion  that  at  least  in  the  100-150  MeV 
range,  the  scattering  takes  place  predominantly  in  the  /  =  ^/g  state. 

For  a  complete  analysis  of  scattering  we  have  to  consider,  besides  the  isotopic 
spin  states,  the  angular  momentum  states  of  the  pion-nucleon  system.  In  the  low 


1  H.  L.  Anderson  et  al.,  Phys.  Rev.  91,  155  (1953);  J.  Orbar  et  al.,  Phys.  Rev.  95,  624 
(1954). 
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energy  region,  it  is  reasonable  to  suppose  that  the  mesons  are  scattered  mainly 
in  the  S  and  P  states.  Since  there  are  two  isotopic  spin  states  /  =  ^/g  and  /  =-  ^/g 
and  three  angular  momentum  states  S^/^,  P1/2  and  P^j^  for  the  pion-nucleon 
system,  the  low  energy  scattering  can  be  completely  described  in  terms  of  six 
scattering  amplitudes  or  alternatively  by  six  phase  shifts.  We  shall  denote  the 
S-wsive  phase  shifts  for  /  ^  ^j^  and  /  =  i/g  states  by  d^  and  6^  and  the  'p  wave 
phase-shifts  by  d^^,  d^^,  di^,  and  d^^. 

In  the  case  of  scattering  of  7r+  by  protons,  only  the  isotopic  spin  state 
/  =  ^/g  is  involved.  The  total  cross-section  increases  with  energy  and  shows  a 
resonance  at  about  200  MeV.  This  indicates  that  one  angular  momentum  state 
predominates  over  all  the  other  states.  The  measured  angular  distribution  of  jr+ 
which  can  be  expressed  in  the  form 

A  ^ 

A  +  Bcosd  +  Ccos^d  (8) 


dQ 


has  been  found  to  have  a  large  term  proportional  to  cos^  6.  In  order  to  obtain 
such  a  i;erm  from  a  single  state  J  we  must  have  ^  ^  1 .  In  this  case  only  P^j^  is 
such  a  state.  The  other  states  also  contribute  though  not  to  the  same  extent. 
The  cos  6  term  indicates  an  interference  of  S-  and  P- waves. 

The  differential  cross-sections  for  the  three  processes  involving  scattering  by 
protons  can  be  written  as 


do 

do 
dQ 
da 
dQ 


(71+  -^71+)  =  A^  +  B^  COS0  +  C^  cos2(9 
{7Z~  ^-71")  =  A_  +  B_  COS0  +  C_  cos^^ 
{ti-^tjP)  =Aq  +  Bq  GOsd+  Co  GOS^d.  (9) 


A,  B,  C  can  be  expressed  in  terms  of  the  phase  shifts  d^,  d^,  ^33 ,  ^3^ ,  ^^3 ,  and  d^ . 
From  the  phase  shift  analysis,  Fermi ^  noticed  that  the  absolute  sign  of  the  phase 
shifts  is  not  fixed.  What  is  determined  is  only  the  relative  sign.  If  the  signs  of 
all  the  phase  shifts  were  to  be  reversed,  even  then  we  get  the  computed  cross- 
section.  The  absolute  sign  of  the  phase  shifts  can  be  determiaed  from  very  low 
energy  scattering  data  since  in  this  region,  there  is  an  interference  of  nuclear 
scattering  with  Coulomb  scattering,  the  absolute  sign  of  which  is  known. ^ 

Besides  the  ambiguity  of  the  sign  of  the  phase  shifts,  there  exists  another 
ambiguity  which  was  first  observed  by  Yang.^  This  corresponds  to  the  reversal 
of  the  sign  of  (^3^  —  S^^)  and  also  fits  the  experimental  data  equally  well.  We  give 


1  E.  Fermi  et  at.,  Phys.  Rev.  95,  1581  (1954). 

2  J.  Orear,  Phys.  Rev.  96,  1417  (1954). 

^  C.N.Yang  (Private  communication  to  E.Fermi),  see  Bethe  et  al.,  loc.  cit.,  p.  72. 
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below  the  Fermi  and  Yang  solutions  of  the  meson-proton  scattering  phase  shifts 
in  degrees. 

Fermi  solution 


Energy 

S 

P 

/2 

P^k 

MeV 

/  = 

^3 

V2 
^1 

(53  3 

V2 

<5l3 

V2         V2 

^31            ^H 

120 
135 

-15.2 
-14.0 

9.0 
10.3 

29.6 
37.9 

3.9 
2.0 

1.8     -2.8 
5.4     -4.6 

Yang  solution 

120 
135 

-15.4 
-14.2 

9.1 
10.4 

12.9 
17.2 

-1.4 
-2.9 

38.6         3.8 
49.3         5.6 

The  Fermi  solution  gives  a  large  phase  shift  for  633  which  as  we  have  seen  earlier^ 
is  responsible  for  the  resonance  inn'^  —  p  scattering.  On  the  other  hand  the  Yang 
solution  gives  a  large  scattering  in  the  P^j^  state  and  there  is  no  way  of  explaining 
such  a  large  phase  shift  for  d^^ .  The  Yang  solution  is  less  reasonable  and  can  be 
ruled  out  experimentally  by  measuring  the  polarization  of  the  recoil  nucleon. 
The  Fermi- Yang  ambiguity  can  also  be  resolved  theoretically  by  the  application  of 
dispersion  relations.^  The  pion-proton  scattering  cross-section  is  of  the  form 
{A  +  B  cos  6)^  +  C  sin  ^6  where  the  sin  ^6  term  corresponds  to  spin-flip,  while 
the  other  term  corresponds  to  no-spin  flip.  For  forward  scattering,  sin^  6^0 
and  the  dispersion  relation  will  have  no  term  corresponding  to  spin-flip.  If  we 
differentiate  the  dispersion  relation  with  respect  to  the  scattering  angle  to  obtain 
the  rate  of  change  of  amplitude  in  the  forward  direction,  then  it  is  sensitive  to 
spin-flip.  Further,  the  derivative  dispersion  relation  will  not  involve  /S-wave 
phase  shifts  so  that  the  scattering  amplitude  should  be  a  function  of  only  the 
energy  and  the  P-wave  phase-shifts.  It  can  be  written  as  a  term  proportional 
to  /2  plus  a  rapidly  convergent  integral.  In  a  graphical  representation,  while  the 
Fermi  phase-shifts  form  a  linear  plot  (as  required  by  theory)  and  intersect  the 
axis  at  a  point  giving  a  value  /^  =  0.1  for  the  pseudo vector  coupling  constant 
(see  next  section)  which  is  in  excellent  agreement  with  the  value  obtained  by 
other  methods,  the  Yang  phase  shifts  lead  to  an  erratic  behaviour  with  a  negative 
value  for  p.  Thus  the  application  of  derivative  dispersion  relations  rules  out  quite 
conclusively  the  Yang  set  of  phase-shifts.^ 


1  W.  C.  Davidon  and  M.  L.  Goldberger,  Phys.  Rev.  104,  1119  (1956).  See  also  chapter  V. 

2  MiNAMi  {Progr.  Theor.  Phys. 11,  213(1954),  found  a  further  ambiguity  in  assigning  phase 
shifts  which  can  be  stated  as  follows :  If  for  a  given  /  and  J  of  the  pion-nucleon  state,  we 
interchange  the  phase  shifts  for  I  =  J  +  ^  and  V  =  J  —  ^,  the  differential  cross-sections  are 
unaltered  which  means  that  the  dominance  of  the  P3/2  state  may  well  be  the  dominance  of 
the  2)3/2  state.  Dispersion-theoretic  considerations  rule  out  the  Minami  set  of  phase  shifts. 
also. 
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Fig.  27.  Fermi(F)  vs.  Yang(Y)  solution— application  of  dispersion  relations. 


3.  OTHER  THEORIES  NOT  INVOLVING  EXPANSION  IN 
TERMS  OF  A  COUPLING  CONSTANT 

We  now  come  to  a  consideration  of  certain  approximation  methods  based  on 
quantum  mechanical  or  field  theoretic  considerations  which  do  not  involve 
expansion  in  terms  of  a  coupling  constant.  We  shall  be  considering  the  Chew- 
Low  formalism  in  detail,  but  before  doing  this  we  shall  look  briefly  into  two  other 
methods  which  have  been  used,  namely,  Heitler's  theory  of  radiation  damping^ 
and  the  Tamm-Dancoff  method  ^  both  of  which  can*  be  considered  generalizations 
of  the  wave-mechanical  theory  of  level-widths  developed  by  Weisskopf  and 
Wigner^. 

1 .  Heitler's  theory  of  radiation,  damping 

We  have  already  given  a  derivation  of  Heitler's  integral  equation  (chapter  III, 
sec.  1),  which  we  rewrite  here  for  convenience 


<^|^|0> 


iA\K\Oy  +  iji  UA\K\ByiB\T\oy 

B 


(1) 


1  W.  Heitler,  Proc.  Camb.  Phil.  Soc.  37,  291  (1941);  W.  Heitler  and  H.W.Peng, 
ibid.  38,  296  (1942).  M.  L.  Goldberger,  Phys.  Rev.  84,  929  (1951).  W.  Fatjli,  Meson  Theory 
of  Nuclear  Forces.  Interscience  Publishers,  New  York  (1946),  Chapter  IV. 

2  I.  Tamm,  J.  Phys.  (U.S.S.R.)  9,  449  (1945).  S.  M.  Dancoff,  Phys.  Rev.  78,  382  (1950). 

3  V.  Weisskopf  and  E.  Wigner,  Z.  Phys.  63,  54  (1930);  65,  18  (1930). 
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where  all  the  matrices  are  defined  only  between  states  on  the  energy  shell.  Also 

<l\K\my  =  yj  <l\Hl:l\my  (2) 

with  the  understanding  that  all  the  intermediate  states  involved  are  off  the  energy 
shell  (which  is  taken  care  of  by  taking  only  the  principal  values  in  the  integrations 
over  the  intermediate  state  energies).  The  convergence  necessary  for  obtaining 
equation  (1)  can  be  achieved  by  giving  an  extension  to  the  target  particle  (nucleon) 
which  however  means  that  one  has  to  lose  the  relativistic  in  variance  of  the  theory 
(as  is  the  case  for  the  Chew-Low  theory).  Heitler  uses  instead  a  "correspondence 
argument"  based  on  analogy  with  quantum  electrodjniamics  which  states  that 
in  the  sum  <^A  \K\By  all  terms  beyond  the  lowest  non- vanishing  term  shall  be 
omitted.  The  motivation  for  this  is  that  the  matrix  elements  of  the  higher  orders 
generally  diverge.  Heitler 's  rule  will  mean  that,  if  for  example,  the  initial  state  |0> 
consists  of  a  single  physical  nucleon  with  no  meson,  the  only  matrix  that  need 
be  taken  in  the  expansion  of  <^B\K\Oy  is  <^B\Hj[\Oy  if  \By  represents  a  state 
containing  a  nucleon  plus  a  meson  and  if  Hj  is  the  usual  Yukawa  type  of  coupling 
(absorption  or  emission  of  a  single  meson  at  a  vertex).  If  on  the  other  hand  both 
the  initial  and  final  states  contain  a  nucleon  and  a  meson  (meson  scattering), 
then  obviously  the  first  order  term  will  be  zero  and  <^B\K  \0y  has  to  be  approxi- 
mated by  the  second  order  term.  It  can  be  proved  that  Heitler 's  procedure  is 
relativisticaUy  invariant.  Further,  for  any  K,  the  unitarity  of  the  /S-matrix  is 
maintained  which  is  an  argument  in  favour  of  the  theory  since  one  does  not 
obtain  obviously  meaningless  cross-sections  which  increase  indefinitely  with 
energy.  The  theory  using  this  approximation  is  called  the  theory  of  radiation 
damping  since  the  inclusion  of  the  second  term  in  equation  (1)  produces  the 
effect  of  true  radiation  damping  in  the  classical  sense  of  reactions  due  to 
real  processes  (e.g.  the  reaction  of  an  emitted  photon  on  an  accelerated  eletr on). 
Omission  of  this  second  (damping)  term  leads  to  the  results  of  the  usual  pertur- 
bation theory. 

Let  us  consider  as  an  application  of  the  theory  the  scattering  of  a  meson  from 
a  nucleon.  Choosing  for  simplicity  the  extreme  non-relativistic  case  (no-recoil 
limit  for  the  nucleon)  and  using  pseudovector  coupling,  we  can  write  the  lowest 
non- vanishing  term  of  <^k\K  |feo>  as: 

<:k\K\koy=^r^o  ko<J'k.  (3) 

Here  kQ  and  k  are  the  initial  and  final  meson  momenta,  ^  and  co  the  meson  mass 
and  energy  respectively  and  /  is  the  pseudovector  coupling  constant.  The  Heitler 
equation  will  be 

<k\T\k,y  =  —^a.k,a.k  +  il^JW    ] 

fdQ'oi'k'O'kik'lTlkoy.  (4) 
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The  solution  of  this  equation  is  found  by  noting  that  the  only  scalars  that  can 
be  formed  from  ko ,  k  and  a  are  ko  •  k  and  a-{koXk).  The  solution  must  therefore 
be  a  linear  combination  of  these.  Actually  the  solution  turns  out  to  be 

(5) 


<fe|T|feo>  = 

{l  +  2ix)kQ'k  +  ia'ko 

l-ix  +  2x^ 

X  k 

where  x  -■ 

^    ,      .  The  cross-section  is 

da 

f   1 

K 

[l  +  4tx^Q0s^e] 

dQ 

4t7t     JLl^ 

JU^CO^ 

{l  +  2x^f  +  x^' 

Por  very 

large 

X,  the  cross- 

section  is 
da 

36 

dQ       kl 


(6a) 


cos^  19  (6b) 


showing  the  characteristic  P-wave  angular  distribution  of  the  scattered  mesons. 
Heitler's  procedure  also  predicts  the  (3.3)  resonance  at  the  correct  value  of  the 
pion  kinetic  energy  (^^200  MeV),  but  gives  too  high  a  cross-section  for  ordinary 
and  charge  exchange  scattering  of  tt".  Further  there  are  cases  in  which  the  neglect 
of  the  higher  order  terms  in  K  is  simply  incorrect.  The  most  prominent  case  is 
in  the  problem  of  electron  scattering  taking  into  account  radiative  corrections 
where  it  has  been  shown  ^  that  the  inclusion  of  virtual  photons  in^  removes  the 
discrepancies  found  in  the  simpler  treatment.  Blatt^  has  shown  that  for  a  few 
simple  problems  the  neglect  of  higher  order  terms  would  appear  to  be  numeri- 
cally unjustified,  since  the  neglected  terms  are  larger  than  those  retained. 

The  Tamm-Dancofl  approximation 

In  this  the  state  vector  of  the  system  considered  is  expanded  in  terms  of 
complete  sets  of  "bare"  particle  states  which  are  eigenfunctions  of  the  free  field 
Hamiltonian,  the  expansion  coefficients  being  interpreted  as  the  probability 
for  finding  a  definite  number  of  bare  particles  in  the  state,  i.e. 

+  i^JJdH,d^k,c{p;  fei,  k,)aHk,)aHk,)0o  (7) 

where  0q  is  the  free  particle  meson  vacuum,  p  the  nucleon  momentum  and 
a^  (fe)  and  a  (k)  are  the  creation  and  annihilation  operators  of  a  meson  of  momen- 


1  T.  Sebe,  Progr.  Theoret.  Phys.  3,  304  (1948). 

2  J.  B.  Blatt,  Phys.  Rev.  72,  466  (1947). 
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turn  fe.  Writing  the  expansion  in  the  form    yjC^O^  where  11^0^  =  ^n^n  and 
substituting  it  in  the  Sehrodinger  equation      ^ 

^^  ^^*  U  Cn(0m,  (Ho  +  Hi)  0n)  =-  E  JJ  (0,,,   0,)  Cn 

n  n 

=  Em  C,n  +  Z  i^m  ,  Hj  0„)  Cn=EC^  (8) 

n 

or  {E  -  EJ  C^  =  2  {0m,  Hi  0 J  Cn .  (9) 

n 

If  we  consider  Hj  to  be  linear  in  the  meson  field,  this  represents  an  infinite 
system  of  coupled  equations,  a  typical  one  of  which  connects  the  amplitude  for  n 
mesons  with  those  for  tt  —  1  or  %  +  1  mesons.  To  make  a  solution  of  this  set  pos- 
sible, the  number  n  of  the  particles  is  restricted  so  that  the  set  of  equations  is  broken 
off  arbitrarily. 

The  rate  of  convergence  of  the  above  procedure  is  still  an  unsolved  problem, 
the  expectation  being  that  better  approximations  can  be  had  by  taking  an  in- 
creasingly larger  number  of  particles.  The  computation  involved  in  the  solution 
of  the  integral  equations  for  the  simplest  problems  are  prohibitive.^  But  the  chief 
objection  against  the  approximation  is  that  the  equations  are  unrenormaUzable 
in  higher  approximations.  The  approximation  assumes  that  those  components 
of  the  state  vector  containing  more  than  a  certain  number  of  bare  mesons  are 
negligibly  small,  an  approximation  which  is  known  to  be  completely  false  for 
the  experimental  value  of  the  coupling  constant. 

4.  THE  CHEW-LOW  FORMALISM^ 

The  Kemmer  Hamiltonian 

The  Yukawa  interaction  was  discussed  in  the  chapter  on  single  particle  wave 
functions  and  their  equations.  In  that  formulation,  only  the  wave  functions  of 
the  nucleons  occurred  and  U  represented  the  potential  field  of  the  meson.  If  this 
interaction  were  to  be  represented  in  the  field-theoretic  form  we  need  to  include 
the  concept  of  emission  and  absorption  of  charged  and  neutral  mesons  with  the 
creation  and  annihilation  of  the  corresponding  nucleons. 

Kemmer^  postulated  a  symmetric  theory  of  meson-nucleon  interactions  in- 
troducing the  isotopic  spin  formalism  to  include  the  emission  and  absorption  of  all 

1  F.  J.  Dyson  et  al.  Phys.  Rev.  95,  1644  (1954). 

2  This  section  is  essentially  based  upon  Chew's  article  (see  below)  on  pion  scattering  and 
photoproduction.  The  important  papers  and  reviews  on  the  subject  are:  G.  F.  Chew,  Phys. 
Rev.  89,  591  (1953);  94,  1748,  1755  (1954);  95,  1669  (1954);  also  his  article  "Theory  of  Pion 
scattering  and  photo  production"  in  the  Handhuch  der  Physik,  Springer- Verlag,  Vol.  43 
{1959),  which  contains  an  extensive  bibliography  on  the  subject.  G.C.Wick:  Rev.  Mod. 
Phys.  27,  339  (1955);  F.  E.  Low,  Phys.  Rev.  97,  1392  (1955);  also  Rev.  Mod.  Phys.  29,  216 
(1957) ;  G.  F.  Chew  and  F.  E.  Low,  Phys.  Rev.  101,  1570,  1579  (1956). 

3  N.  Kemmer,  Proc.  Camh.  Phil.  Soc.  34,  354  (1938). 
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the  three  types  of  mesons.  The  interaction  Hamiltonian  which  should  be  a  scalar 
in  isotopic  spin  space  was  written  as 

Hj=gipr(\/2r,(p  +  y2r^(p*  +  rs(ps)y).  (1) 

This  corresponds  to  a  proper  combination  of  the  three  distinct  equations  which 
we  wrote  in  the  wave  function  formalism,  the  weights  1/2  being  attached  to  t+ 
and  T_ .  The  choice  of  these  weights  will  be  apparent  if  we  express  t+  and  T-  in 
terms  of  t^  and  T2  in  which  case  the  Hamiltonian  reads : 

^i  =  g^rir^  (^1  +  T2  993  +  "^3  9^3)  ^  •  (2) 

The  meson  variable  99  =  ~77^{(pi  +  i(P2)  when  expressed  in  terms  of  creation  and 

destruction  operators  represents  the  destruction  of  a  71; ~  or  the  creation  of  a  jc^. 

Similarly  cp*  =  -77^  {(pi  —  i(p2)  represents  the  creation  of  a  tt"  or  destruction  of  a  71+ 

while  (^3  represents  the  creation  or  destruction  of  a  jr^.  Though  the  symmetry  of 
the  Hamiltonian  is  apparent  only  when  it  is  written  in  terms  of  9?! ,  9^2 »  9^3 »  physical 
interpretation  is  possible  only  if  it  is  expressed  in  terms  of  99,  99*  and  993. 
We  give  here  the  Feynman  diagrams  corresponding  to  all  the  possible  processes 
obtained  by  operating  (1)  on  a  nucleon  wave  function  (ignoring  the  isotopic  spin 
operator  t^). 

If  9?^,  992  and  993  can  be  considered  as  a  3-vector  in  the  same  charge  space 

3 
as   (ti,  Tg,  T3)  then   the  sum  2j'^i9^i  ^^^  ^®  considered  as  the  scalar  product 

r '  (f.  For  this  to  exist  there  must  be  a  hermitian  operator  T  which  generates 
unitary  transformations  of  99  corresponding  to  infinitesimal  rotations  in  isotopic 
spin  space.  Taking  the  component  T^,  we  have  for  a  real  infinitesimal  e, 

e-^3^^e--^3^^^  +  ^£[T3,  cp]  (3) 

with  two  similar  relations  for  992  and  993.  If  this  transformation  is  to  represent 
a  rotation  by  e  about  the  3 -axis,  we  must  have 

*[^3.9^i]  =  —9^2;  *[^3»9^2]  =  9^1;  *[^3.9^]  =  0.  (4) 

These  conditions  are  satisfied  by  choosing 

T^=jd^r[0,{r)n^(r)-7z,{r)0,{r)]  (5) 

where  71^  is  the  field  momentum  conjugate  to  99^  and  is  connected  with  it  by  the 
commutation  rule 

[niir),<Pi{r')]  =  -iS{r-r')dii.  (6) 

Thus  we  can  write 


1 


T=  fdr{(p 


xjt).  (7) 
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Fig.  28.  Diagrams  representing  the  meson-nucleon  interaction.  For  simplicity,  dia- 
grams involving  anti-nucleons  are  not  shown. 
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The  rotations  of  the  nucleon  isotopic  spin  vector  t  are  generated  by  r/g  itself^ 
because  of  the  commutation  rule  t  x  t  =  2ir.  It  can  be  proved  that  /  =  —  -f 
commutes  with  the  Kemmer  interaction  J^cpiTi.  4  =  —  +  T^  is  the  total  charge 

i  2 

of  the  pion-nucleon  system  and  hence  commutes  with  the  charge -conserving 
Kemmer  interaction.  We  can  show  that  I^  and  ly  also  commute  with  ^j^i'^i- 

i 

Thus  the  Kemmer  interaction  is  invariant  under  transformations  generated  by  / 
so  that  T  •  ^  is  a  true  scalar  product.  /  also  commutes  with  the  kinetic  energy  part 
of  the  HamUtonian  and  is  called  the  total  isotopic  spin  operator.  The  eigenvalue 
of  /2  is  a  constant  of  motion. 

The  Kemmer  interaction  is  the  only  one  of  the  Yukawa  type  (linear  in  the  meson 
field)  which  is  invariant  under  all  rotations  in  charge  space,  i.e.  is  a  true  scalar 
in  charge  space. 

Pion-nucleon  scattering  ^ 

The  complete  interaction  Hamiltonian  density  for  the  interaction  of  a  pseudo- 
scalar  meson  with  the  nucleon  is  given  by  the  expression 

Hi  =  gip(x)y5riip{x)(pi{x)  +  dmip(x)y){x) 

+  iS/x^^i{^)(pd^)  -~^,[(pdx)(Pi{xW-  (8) 

The  first  term  on  the  right  is  the  Kemmer  Hamiltonian.  The  second  and  third 
terms  are  the  counter  terms  for  the  proper  masses  of  the  nucleon  and  the  meson 
respectively.  The  last  term  arises  as  follows :  due  to  the  absence  of  gauge  in  variance 
(since  the  meson  mass  is  different  from  zero)  the  diagram  with  four  external 
meson  lines  leads,  in  contrast  to  quantum  electrodynamics  to  a  divergence.  In 
order  to  compensate  for  this  we  have  to  introduce  a  counter  term  which  is  precisely 
the  last  term.  ?.  is  the  coupling  constant  of  this  direct  pion-pion  interaction  which 
is  expected  to  become  important  at  high  energies.  As  not  much  is  known  about 
this  interaction,  we  can  omit  the  last  term  from  further  consideration.^ 


1  F.  E.  Low,  Phys.  Rev.  97,  1392  (1955). 

^  Recently,  however,  experimental  evidence  has  appeared  which  would  seem  to  point 
to  the  existence  of  a  strong  pion-pion  interaction.  Evidence  for  such  an  interaction  has  been 
found  for  example  in  the  analysis  of  single  pion  production  data  in  7i-p  collisions  by  F.  Bon- 
signori  and  F.  Selleri  {Nuovo  Cim.  15,  465  (I960))  and  by  I.  Derado  {Nuovo  Cim.  15,  853 
(1960)).  Further,  Frazer  and  Fulco  have  shown  that  the  isotopic  vector  part  of  the  nucleon 
form  factors  may  be  satisfactorily  fitted  if  one  assumes  a  n-n  scattering  resonance  in  the 
J  =  1,  /  =  1  state  (W.  R.  Frazer  and  J.  R.  Fulco;  Phys.  Rev.  Lett.  2,  365  (1959);  UCRL 
8880).  Using  the  Mandelstam  representation  (see  next  chapter).  Chew  and  Mandelstam  have 
found,  on  assuming  that  there  is  no  stable  bound  state  of  the  pion-pion  system,  that  the 
limits  for  the  pion-pion  coupling  constant,  X  are  given  by  —  0.36  ^XS  0.3.  (G.  F.  Chew  and 
S.  Mandelstam,  Phys.  Rev.,  119,  467  (I960)).  Recent  calculations  such  as  for  instance  the 
study  of  pion  production  in  pion-nucleon  colHsions  and  photoproduction  of  two  and  three 
mesons  [S.  K.  Srinivasan  and  K.  Venkatesan,  NilcI.  Phys.  (to  be  published)]  seem  to 
indicate  that  A  does  lie  within  this  range. 
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The  iS-matrix  element  for  the  scattering  of  a  meson  of  momentum  q^  and  iso- 
topic  spin  index  oc  to  a  state  of  momentum  of  q^  and  isotopic  spin  index  ^  while 
the  nucleon  changes  its  momentum  from  p^  to  p^  can  be  written  as 

X  (0^„  a,,  ^(^/(^i)  .  . .  Hi{x^))  al  0^J  (9) 

where  all  the  quantities  are  in  the  interaction  representation,  a^  and  aj  are  the 
annihilation  and  creation  operators  of  a  meson  respectively  and  0^  represents  the 
state  containing  a  "bare"  nucleon  of  momentum  p.  Commuting  the  operator  a^ 
to  the  right  and  the  operator  aj^  to  the  left  and  using  the  relevant  commutation 
rules,  we  get 

(-y 

1/4  COj  ft>2 

2  -—J-  /  dx,  dx2  • . .  dxn(0^,,  T{Hj{x^) . . .  Hi{xn)  jpiy)  jAx))  0^J       (10) 

where  co^  and  co^  are  the  energies  of  the  initial  and  final  mesons  and  ji{x)  is  the 
current  operator  in  the  interaction  representation  and  is  given  by 

ji"^  =  gf{x)  y5'^iw{^)  +  Sju^(pi{x).  (11) 

We  now  use  the  general  relationship,  based  on  the  Gell-Mann-Low^  theorem, 
between  the  matrix  elements  in  the  interaction  and  Heisenberg  representations 

2  ^—Y--     dx,...  dx„{0^„  T(Hj{x,)  . .  .  Hi(x„)  j^{y)  jjx)  <P^.), 

n  =  0        'f'  ■       J 

-('P,.>T{i^{y)U{x))'P,y  (12) 

where  all  the  quantities  on  the  right  hand  side  are  in  the  Heisenberg  representation, 
the  ^^'s  being  exact  single  particle  energy-momentum  eigenstates  of  the  total 
Hamiltonian.  (We  demonstrated  such  an  equivalence  in  the  case  of  the  propa- 
gators in  the  two  representations,  in  chapter  III,  section  3.)  Thus  we  can  write 

<P2-^  q^Pl^lPi'^  gi  ^>  -  Y^^  ^  J  dxdy 

^-''^'^'''^'<P2\T{\^{y)Ux))\p,y..  (13) 

Using  the  translational  invariance  theorem  for  the  Heisenberg  operators 

U{x)  =  eiP^liO)e-iP^  (14) 

and  performing  the  a:-integration  first,  we  get  the  right  hand  side  to  be  equal  to 

1  M.  Gell-Mann  and  F.  E.  Low,  Phys.  Rev.  84,  350  (1951). 

2  This  result  can  also  be  arrived  at  using  the  approach  of  Lehmann,  Symanzik  and  Zimmer- 
MANN  (see  chapter  V)  which  however  is  built  on  more  general  premises  and  does  not  involve 
the  concept  of  a  specific  interaction. 


188  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

Expanding  in  terms  of  a  complete  set  of  "incoming"  states  and  doing  the  y-inte 
gration,  we  finally  get 

I 

<P2\u{0)\myim.\i^{0)\p,y 


V.  <y2li^(0)|^><^lu(0)|yi> 

m        E^-'p^Q-co^-ie 


+  V 

m        E„,  -^a)2-Pio-ie 


(16) 


where  the  ''T"  matrix  is  defined  as  the  corresponding  iS-matrix  with  i  times  the 
deltafunction  factor  for  over-all  energy  momentum  left  out.  This  is  the  Low 
equation  for  scattering  which  is  exact  and  the  properties  of  which  we  shall  be 
presently  studying  in  the  fixed  source  limit.  If  we  retain  terms  involving  more  than 
one  meson  in  the  intermediate  states,  this  equation  AviU  become  a  very  complicated 
system  of  coupled  non-linear  integral  equations.  But  if  we  take  the  inelastic 
cross-sections  to  be  small  compared  to  the  elastic  ones,  we  can  make  the  "Tamm- 
DancofiF"  approximation  of  retaining  only  the  intermediate  states  containing  (i) 
a  single  physical  nucleon  and  (ii)  a  physical  nucleon  plus  one  meson.  We  then 
obtain  an  integral  equation  for  the  scattering  amplitude,  the  inhomogeneous 
term  of  which  will  turn  out  to  be,  as  we  shall  see  presently,  the  renormalized 
Born  approximation. 

Next  we  shall  derive  an  expression  for  the  vertex  function  <^p2  |i(^)|  Pi^-  Using 
translational  in  variance,  we  have 

<P2\m\pi>-<P2\io'+i^')<p(^)\pi> 

=  [-  {P2  -  Pif  +  f^'Wp.,  T  (-S  9>(0)),  0J  e*(*.-^.>'^  (17) 
where  the  last  step  is  obtained  by  going  back  to  the  interaction  representation, 
S  being  the  /S-matrix.  If  we  take  the  lowest  order  term  in  S,  namely  —  i  JHj  (ic^)  dx^ 
and  use  the  anticommutation  rules  for  the  spinor  field  variables  and  the  fact 
that  the  vacuum  expectation  value  of  T((p{Xj)(p{0))  is  the  meson  propagator  we 
get  for  the  lowest  order  matrix  element 

[-  (Pz  -  Pif  +  /*'] 9 ruipi)  y, u(p,)  A^Pz  -  Pt)  e^''.-^'^^.  (18) 

The  complete  matrix  element  (17)  is  obtained  by  replacing  Ap  hy  Ap,  y^  by  F^c 
and  g  and  cp  (x)  by  the  corresponding  renormalized  quantities. 
We  can  write  the  vertex  function  as 

<P2\\M\Pl^-[-{V2-Pl?+M^^] 

X  g^  A'p (^2  -  Pi)  u {P2)  r,, (^2 .  Pi)  u (Pi)  r« .  (19) 

As  such  it  is  practically  useless  for  calculation  since  the  matrix  y^  leads  to  a 
coupHng  between  nucleon  and  anti-nucleon  states  much  stronger  than  that 
between  ordinary  nucleon  states  so  that  the  intermediate  states  will  contain  an 
abundance  of  virtual  nucleon-anti-nucleon  pairs  making  calculations  extremely 
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complicated.  If  however  [{A  p)^/m^]  <  1  then  [{A  p)^  +  ju^]Ap^l.  Therefore  in  the 
non-relativistic  region  we  have 

<.»lu(0)b,>  =  iM|^P  (20) 

where  m  is  the  nucleon  mass.  Using  the  equivalence  theorem ^  for  pseudoscalar 
and  pseudo vector  couplings^ 

i  =  /'  (21) 

we  finally  get 

<P2\iA0)\Pi>  =  ifrr,<y'q  (22) 

which  will  represent  an  emission  or  an  absorption  of  a  meson,  fj.  is  the 
renormalized  rationalized  pseudovector  coupling  constant. 

The  presence  of  the  meson  momentum  in  the  matrix  element  indicates  that 
integrals  over  all  virtual  pions  will  diverge.  The  obvious  way  to  remove  this 
divergence  is  to  cut  off  virtual  pions  of  high  momenta.  Chew^  realized  that  this 
could  be  done  by  fixing  the  nucleon  and  assuming  the  interaction  to  be  non- 
local and  effective  within  a  small  region  which  gives  the  "spread"  of  the  nucleon. 
The  larger  the  spread  the  smaller  the  momenta  of  the  virtual  mesons  to  be  taken 
account  of.  With  a  momentum  cut-off  factor  v  {q)  the  vertex  function  now  reads : 

<P2\l{0)\Pi>-ifrr.v{q)a'q.  (23) 

The  right  hand  side  has  to  be  considered  a  matrix  in  the  spin  and  isospin  space 
of  the  nucleon. 
We  notice  that 

<P2\l{0)\p,>=<P^\l{0)\P2>^-  (24) 

Purther  since  we  are  taking  the  static  limit  for  the  nucleon  we  can  take  ^20  =  Pio 
=  the  rest  energy  of  the  nucleon  as  the  ground  state  energy  and  set  it  equal 
to  zero.  We  now  define  the  operator 

U(Q)   ^  ^frraV{q)(y'q 
pcoq  pco, 

and 

T,{m)^<m\V,\py  (26) 

where  \py  denotes  a  single  nucleon  state.  Using  the  notation  ^i(2)  for  the  T 
matrix  for  a  meson  of  momentum  q^  to  go  to  a  state  with  momentum  q^ ,  we  can 
now  write  the  Low  equation  as 


y    =    i«^^^    ^   "Jr'-cc^K'i)^  •  H  ^25) 


TA^)  =  -U 


Tl{m)T,{m)    _^  T\{m)T^{m) 


^m  -  Mo-is  E.^,  +  COs 


(27) 


^  See,  e.g.  Bethe  and  de  Hoffmann,  Mesons  and  fields.  Vol.  I.  Appendix  (Row 
Peterson  (1955)). 

2  Here  and  in  the  following,  we  assume  the  pion  mass  to  be  equal  to  1. 

^  See,  e.g.  G.  F.  Chew,  "Theory  of  pion  scattering  and  photo  production",  Handhuch  der 
Physik  (Springer- Verlag).  Loc.  cit. 
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where  m  now  refers  to  the  meson  degrees  of  freedom  and  the  ^£  has  been  omitted 
in  the  denominator  of  the  second  term  since  it  is  not  a  "resonant"  denominator. 
The  matrix  has  the  following  properties : 

(a)  It  satisfies  the  unitarity  condition 

Tl{l)-T,{2)  =  27tiZTl  (m)  T^ (m)  6 {E,^  -  co) .  (28) 

m 

(b)  It  obeys  the  "crossing"  theorem.  To  show  this  and  for  future  convenience 
we  define  a  hermitian  matrix  function  t^iiz)  of  a  complex  variable  z  =  0)3  +  is: 

t-M^)         Z  ! ^^'"' ^''"^    I    ^' '"'^ ^^'"^  1  (29) 

»i    I      E,„  —  z  E„  +  z      \ 

"^^^  Umt,,{z)  =  T,{2).  (30) 

We  note  that 

tUz*)  =  h,{-z)  (31a) 

or  equivalently 

^12(2)  =  «2l(-2=)  (31b) 

which  is  called  the  "crossing"  theorem  and  expresses  the  fact  that  for  every 
Feynman  diagram  representing  meson  scattering,  there  must  exist  another  which 
is  got  from  the  first  by  exchanging  the  incoming  and  outgoing  meson  lines. 

(c)  ^21(2^)  ^^s  ^  pol^  ^t  ^^6  origin  with  the  residue 

B,,=  Tl  (0)  T,  (0)  -  Tl  (0)  T,  (0)  = -[V„  F,] .  (32) 

which  is  determined  by  the  single  parameter  /^.  Thus  we  see  that  the  inhomogeneous 
term  of  Low's  equation  is  merely  the  renormalized  Born  approximation  as  already 
mentioned. 

(d)  As  2  ->.  oo,  t2i(z)  behaves  as  Ijz. 

(e)  The  other  singularities  of  t^  ^  (z)  are  also  located  on  the  real  axis  and  are 
branch  points  corresponding  toz  =  ±1,  ±2  .  .  .  etc.  with  cuts  to  ±  oo  respectively. 

Chew  and  Low^  have  shown  that  the  above  conditions  completely  determine 
^21(2)  so  that  if  we  could  get  a  function  having  the  above  properties  it  would  be 
a  solution  of  the  Low  equation.  It  is  convenient  to  connect  ^21(2^)  with  the  experi- 
mentally observed  phase  shifts  of  the  four  P-states  of  the  meson-nucleon  system. 

We  can  write 

y4cOia)2       «  =  i 
where  PJs  are  the  projection  operators  for  the  four  states  (33,  31,  13,  11)  which 
can  be  written  down  as  follows.  ^2i(^)  depends  linearly  on  q^  and  gg  ^^^  since 
it  must  be  a  scalar  its  most  general  form  is  a  linear  combination  of,  gg  '  ^1  ^^^ 
<y  •  (g2  X  ^i)-  If  we  now  form  the  operators 

J^l^{2,  1)  =  ^2  •  gi  +  ^  <y  •  (^2  X  qi) 

and  ^3/^(2,  1)  =  2g2  •  ^i  -  ^  ^^  •  (^2  x  ^1)  (34) 

1  G.  F.  Ohew  and  F.  E.  Low,  Phys.  Rev.  101,  1570  (1956). 
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they  can  be  shown  to  project  out  the  states  of  total  angular  momentum  ^/g  and  ^/^ 
respectively.  As  regards  the  isotopic  spin  projection  operators,  we  note  that  if 
we  define  the  matrix 

^QiQ2  =  '^q^'^q,  (35) 

then 

(R\,,,  =  2-<:,,r,x,T,,  =  Z{R),^„  (36) 

a 

""''  R{R-Z)  =  0  (37) 

so  that  R  has  the  eigenvalues  0  or  3.  Further,  for  the  expectation  value  of  R  for 
the  state  \P7z'^y  which  is  a  pure  ^/a  isotopic  spin  state,  we  have,  since 

■     lP7r-^>  =  |/  =  %;      h=-^l2>  =  -^\P{n  +  i^2)y  (38) 

•       iPn'-\R\Pn^y  =  4  <P(99i  -  i  cp^)  \R\{(p^  +  i  cp.^)y 

=  i<P\R,^-  i R^,  +  i R,^  +  R^^\Py 

=  i<P\2-2r,\Py 

=  J[2«P|P>-<P|t3|P»]  =  0  (39) 

Thus  R\I  =  ^l^y  =  0  (40) 

and  therefore 

iJ|/=V2>  =  3|/=V2>-  (41) 

Since  we  can  write 

=  ^...2+4K.Tj  (42) 

we  have  finally  the  projection  operators  for  the  ^/g  and  1/2  states  respectively  as 

^Val^'   ^^  ==  "3  ^Q^Qi  ~  y 2 

and 

/./.(2,  l)  =  45,„.  +  i-i^^.  (43) 

Combining  the  spin  and  isospin  projection  operators  we  finally  get  the  four 
P- state  projection  operators 

P,i(2,  l)  =  /i,2(2,l)Ji,2(2,l) 

Pl3(2,l)  =  /i,2(2,l)J3,2(2,l) 

P3,(2,l)  =  /,,2(2,  1)^1,2(2,1) 

P33(2,1)  =  /8,2(2,1)J3,2(2,1).  (44) 

The  functions  }i^{z)  have  the  same  singularities  as  the  function  ^21(2^)-  Since  there 
is  complete  symmetry  between  the  spin  and  isotopic  spin  angular  momenta  in 
the  cut-off  theory,  we  need  deal  with  only  the  three  functions : 

/^i  =  ^ii;     ^2  =  ^13  =  ^31;     h^hs'  (45) 
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A  comparison  with  the  conventional  expression  for  the  scattering  amplitude  in 
terms  of  the  phase  shifts  shows  that 

-r .     ,    .  .        e^'^asin^^ 


z-^0 


This  is  also  consistent  with  the  unitary  condition  (equation  28)  which  gives  the 
relation 

ImKiz)-^  q^v^q)\  K{w,)\^.  (47) 


The  zero-energy  limit  of  h^  (z)  is  given  by 

2   /2 


hAz) 


3    z 


(48) 


n 


where  /^  =  -f^^  is  the  renormalized,  unrationalized  coupling  constant.  Thus  we 
4:r  ^     ° 

get  the  first  important  prediction  of  the  Yukawa  theory  namely  that  the  four 

P-wave  phase  shifts,  in  the  zero-energy  limit,  depend  on  a  single  parameter  f^. 

To  remove  the  pole  at  the  origin,  we  can  define  a  new  function 

9A^)  =  ^lK(z)]-'. 


It  obeys  the  crossing  relation 


UB^ 


9M  /3 

-where  5  is  a  reflection  matrix;  i.e.  5^  =  1. 
Explicitly 

1 


A-z) 


(49) 


(50) 


B=^ 


9 


1 


2  8" 
7  8 
2  -1 


The  unitarity  condition  on  it  is 


lmgAco,)^--^<fv^{q) 


0)„ 


Also 


Re  g^  {(Oq)  =  — ^  ^  v2  {q)  cot  d^  {co^ 


(51) 


(52) 


(53) 


which  is  important  because  it  gives  the  real  part  of  g^  in  terms  of  the  phase 
shifts  which  are  experimentally  determinable. 

If  the  function  h^{z)  has  no  zeros  in  the  entire  complex  plane,  the  function 
gr^(z)  has  no  singularities  other  than  branch  points  with  cuts  to  ±  oo.  We  can 
then  write 

oo 

COg  —  Z  0)q  +  Z   _ 

1 


qA^ 


--^/ 


d«)„ 


(54) 
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where  the  functions  F^  and  G^  are  determined  by  the  unitarity  condition  and 
crossing  relation  respectively.  Substituting  the  value  for  F^  we  get  finally 


M)  =  1 


7t 


/-•^l 


+ 


HjCOg) 


where 


HJCOg) 


v^q) 


(55) 


(56) 


H^  is  so  chosen  that  g^  satisfies  the  crossing  relation.  Chew  and  Low  got  a  closed 
form  for  the  solution  only  on  simplifying  the  matrix  B  to  the  form 


The  result  is 


B^ 


ro 

0  n 

1 

-1  1 

1 

0  o_ 

COr> 


q^v^q) 


coi 


COn 


+ 


co« 


(57) 


(58a) 


^i(^)=l--K-/' 


^/- 


q^v'^(q) 


(jot 


(Or 


+ 


COa  +  Z 


(58b) 


We  notice  that  gl  (z)  is  not  determined.^  However  these  solutions  are  in  excellent 
I        agreement  with  the  ones  got  from  the  rigorous  dispersion  relations.^ 

The  form  of  the  real  part  of  the  function  g^^iz)  (53)  suggests  by  analogy  to  the 
corresponding  situation  in  nucleon-nucleon  scattering  theory,  that  we  can  try 
an  "effective  range"  treatment  of  the  present  problem.  From  (53)  and  (55) 
we  have 


^g^i.)^Mt(^,,,Sj0. 


(Or 


COr. 


n       J      0)a 


q'vHq)  ^  P 


(0)g   -    CO) 


n  J 


d(Og     q^v^iq) 


ft)2        (oj    +  (O) 


Hjcog)      (59) 


1  Casth^lejo  et  at.  {Phys.  Rev.  101,  453  (1956)  have  pointed  out  that  if  we  add  to  g'^(z) 

c  d 

-i ^r~r  I  where  coq  ,  c^  and  d^  are  constants, 


an  arbitrary  number  of  terms  oi  the  forms  z 


(Dq—  Z         Wq+  z 

the  last  two  satisfying  the  crossing  relation  c^  =  —d^,  c^  =  —d^  tlJe  new  functions  will  satisfy 
the  one-meson  Low  equation.  It  can  be  shown  that  the  solutions  given  above  without  these 
extra  terms  are  unique  in  that  they  correspond  to  the  analytic  continuation  of  the  power 
series  in  p  for  the  functions  h'^{z)  or  g'cc{z). 

^  The  solutions  are  also  in  agreement  with  the  numerical  solution  of  G.  Salzman  and 
¥.  Salzman,  Phys.  Rev.  108,  1619  (1951). 
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where  P  denotes  the  Cauchy  principal  value.  The  effective  range  approximation 
consists  in  neglecting  the  dependence  of  Re  g^  (co^)  on  co^  for  values  of  co^  <  ft>max> 
i.e.  we  can  write 

i^'-.[l-<o,.X(^,)]  (60) 

COq 

where  r\  (cOg)  is  almost  a  constant  for  small  cOg  and  is  given  by 


Ol  +  O^ 

cOgqv^(q) 


(61) 


where  the  cr's  are  the  total  cross-sections  for  the  states  corresponding  to  the 
suffixes.  This  linear  dependence  of  the  known  quantity  ?.^q^  cotdjwq  on  the 
meson  energy  Wq  has  been  verified  experimentally  for  the  (33)  phase  shift.  The 

intercept  at  zero  energy,  X^^  =  ("v)/^  gives  a  value  of /^  =  0.08. 

We  notice  that  the  (33)  state  effective  range  r\  is  positive  so  that  there  should 

be  a  resonance  at  an  energy  cOq  =  -j-  which  is  when  ^33  =  90°.  This  is  an  excellent 

agreement  with  the  well-known  resonance  in  scattering  at  200  MeV  incident 
energy  of  the  pion,  if  we  take  energy  cut-off  to  be  co^ax  ^  ^A*- 

The  above  effective  range  approximation  which  implies  that  an  expansion  of 
r^(co)  in  powers  of  (d  has  a  radius  of  convergence  o)  ^  co^ax  should  not  be  correct 
because  there  are  singularities  in  the  form  of  branch  points  at  z  ==  ±  1  •  But  actually 
the  contributions  from  the  non- analytic  parts  are  small  compared  to  the  remainder 
of  g^  (z)  which  has  no  other  singularity  in  the  low-energy  region. 

Finally  we  may  mention  that  independent  investigations  of  the  relativistic 
theory  of  meson  scattering  by  means  of  the  dispersion  relations  generally  uphold 
the  validity  of  the  above  results  in  the  low  energy  region. 

The  theory  of  photo-meson  production  ^ 

Chew's  cut-off  theory  has  been  successfully  applied  to  the  problem  of  pro- 
duction of  a  meson  of  momentum  q,  by  a  photon  of  momentum  k  impinging  on 
a  static  nucleon.  The  matrix  element  for  the  process  is 

Hu(q)  =  {wt^,  JAvd-Afy,)  (62) 

where  \  the  current  density  operator  of  the  matter  system  is  determined  by  the 
gauge-transformations  of  the  field  variables.^  This  leads  to  the  conventional 

^  G.  F.  Chew,  "The  theory  of  pion  scattering  and  photo  production",  Handbuch  der 
Physik  Springer- Verlag  loc.  cit.  G.  F.  Chew  and  F.  E.  Low,  Phys.  Rev.  101,  1579  (1956) 

2  This  means  that  we  have  to  replace  the  p  operator  wherever  it  occurs  in  the  Hamil- 
tonian  by  p  —  evl.  The  interaction  term  in  the  75  theory  (i.e.  with  pseudoscalar  coupling) 
does  not  obviously  contribute  to  the  current  density  operator  while  the  corresponding  term 
in  the  static  theory  does,  because  of  the  presence  of  the  gradient. 
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division  of  j  into  three  parts  corresponding  to  the  "meson  current",  "nucleon 
current"  and  the  "interaction  current".  But  in  Chew's  treatment,  as  we  want 
to  lay  emphasis  on  the  difference  between  ip^  the  single  physical  nucleon  state 
and  the  excited  states,  which  contain  one  or  more  real  mesons,  the  current  density 
operator  is  split  as 

i-lN  +  U'  (63) 

where  [^  is  defined  to  be  independent  of  the  meson  variables,  but  has  the  same 
matrix  elements  as  the  total  current  operator  between  the  single  physical  nucleon 
states  y)Q .  j^  is  defined  negatively  as  j  —  i^  and  has  the  property 

K,  L]  =  K.  i];  K,  LI  =  K  il-  (64) 

Thus  it  has  matrix  elements  connecting  ^q  to  only  states  containing  one  or  more 
real  mesons.  The  matrix  element  for  the  interaction  of  a  slow-moving  photon 
mth  a  fixed  nucleon  can  be  written  as 

[y)l  li'Adrip^)^{u^,  -/i.a-AxAu-)  (65) 

where  u^'^  are  "bare"  nucleon  wave  functions  and  /btr  the  static  magnetic  moment 


of  the  nucleon  which  can  be  written  as  jUj.  = 


~— +       -     T3 


— —  where 
2m 


jUp  =--  2.78  and  /^„  =  — 1.91  are  the  magnetic  moments  of  the  proton  and  neutron 
respectively.  For  large  photon  momentum  k  however  we  have  to  write 


Mk') 


2m 


l^l+f2L.  F^(k2)  +  Jj""     '"")  r,F,{k^) 


(66) 


where  Fg{k^)  and  I\{h^)  are  the  magnetic  moment  form  factors  similar  to  the 
ones  used  in  electron  scattering  theories.  We  have  a  further  division  of  \^  into 
an  isotopic  vector  part  jj^  involving  the  isotopic  spin  matrix  T3  and  an  isotopic 
scalar  part  jg .  The  matrix  element  for  the  former  is  given  by 

m (9)  =  (r^->,  -^,- ia-(kxX) F„ (h^) y>„) 

/,         4m  v{q')    \   r  J       ^ 

Here  X  is  the  polarization  vector  of  the  photon  and 

iMp-f^n)       % 


A  = 


2  2m 


where  v{q')  and  T^'  {q)  denote  the  same  quantities  as  in  the  section  on  scattering. 
This  simple  result  which  means  that  j^,  generates  neutral  mesons  of  momentum 
g'  =  (fe  X  A)  which  are  then  scattered  by  the  nucleon,  does  not  follow  from  the 
13* 
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general  theory  of  scattering,  but  is  a  chance  result.  The  matrix  element  due  to  j^ 
is  not  so  simply  related  to  the  scattering  amplitude.  It  is  given  by 

which  is  the  Low  equation  for  photo-meson  production.  This  part  turns  out  to 
be  small  compared  to  the  previous  one  as  it  has  a  factor  /Hp  +  jUn  ^  0.9  nuclear 
magnetons  whereas  jUp  —  jUn  =  4.7. 

Finally  the  operator  j^  gives  the  matrix  element 

mi^)^m,)-u\T'f':l  H-g^<$^|        (69) 

n    [  En-  o)g-ie  En  +  a)g       j 

where 


jj(n)  /    V  *  ^r  fr  l^Q  "^3  ~  "^3  "^ffJ 


^^     2(y.(g-fe)(g.A) 


{q  -  fe)2  +  1 


(70) 


[4r(Oqkfl^  2 

The  first  term  of  H^j^^  {q)  is  a  pure  ;S-wave  production  term  which  depends  only 
on  the  parameter  fj.,  a  result  known  as  the  KroU-Ruderman  theorem. ^  The  value 
for  p  =  /r/47r  thus  got  is  0.074  in  satisfactory  agreement  with  the  value  got  from 
scattering. 

Predominant  contributions  to  pion  photoproduction  comes  from  the  terms 
^k'^  (?)  ^^^  ^k^^  [q)-  ^^  ^?^  (q)  vanishes  for  the  case  ofn^  production  we  can  directly 
test  the  isotopic  vector  part  H^p{q)  which  makes  the  major  contribution  to  the 
neutral  P-wave  amplitude.  Using  the  measured  total  cross-section  of  n^  at  a 

photon  energy  of  325  MeV  where  there  is  resonance,  the  value  of  Cq  =  ^ — —  -j 

is  found  to  be  0.05  ±  0.003  which  compares  with  the  theoretical  value  Cq  =  0.06. 
Amplitudes  of  the  form  H^^^  {q)  lead  to  the  characteristic  (2  +  3  sin^^)  dependence 
of  the  differential  cross- section  on  the  angle  between  the  photon  direction  and  the 
direction  of  the  emitted  meson.  Further,  the  energy  dependence  for  the  total 
<3ross- section  is  correctly  reproduced. 

The  total  cross-section  for  charged  mesons  is  very  closely  the  contribution 
from  H^^^q)  which  is  P-wave,  and  the  first  term  of  H^^^q)  which  is  S-wave. 
Hence  if  we  substract  the  S-wave  part  from  the  measured  total  cross-section 
at  325  MeV  photon  laboratory  energy  we  may  test  the  P-wave  part  of  the  charged 
meson  production  predicted  by  H^^^  (q).  The  result  is  a  value  of  0.046  for  Cq  .  Further, 
the  interference  of  H^^^  {q)  and  the  S-wave  part  of  Hf^  [q)  should  produce  preferen- 
tially backward  positive  photo-pions,  and  such  an  asymmetry  is  actually  observed. 

The  problem  of  the  production  of  a  pair  of  mesons  by  a  photon  has  also  been 
studied  using  the  Chew-Low  method.^  A  good  agreement  is  obtained  with  the 
experimental  cross-sections  available. 

1  N.  M.  Kroll  and  M.  Ruderman,  Phys.  Rev.  93,  233  (1954). 

2  See,  e.g.  R.  E.  Cutkosky  and  F.  Zachariasen,  Phys.  Rev.  103,  1108  (1957).  S.  K. 
Srinivasan  and  K.  Venkatesan,  Niicl.  Phys.  12,  418  (1959). 
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Finally  we  may  mention  that  Chew's  method  has  been  applied  to  other  problems 
Uke  the  anomalous  magnetic  moment  of  the  nucleon,  the  theory  of  nuclear 
forces  and  the  scatteriug  of  iC-mesons.  Miyazawa's^  calculations  give  meson  con- 
tributions of  i  2  nuclear  magnetons  (approximately)  to  the  proton  and  neutron 
anamalous  magnetic  moments  respectively.  A  nuclear  potential  with  a  repulsive 
core  was  obtained  (Gartenhaus)  ^  by  using  this  theory.  The  width  of  the  core 
was  found  to  depend  on  the  energy  cut-off  and  the  coupling  constant.  This 
potential  has  been  used  recently  as  the  startiug  point  for  further  work  in  the 
problem  of  nuclear  forces.^  It  may  also  be  mentioned  that  the  use  of  Chew-Low 
ampUtudes  to  the  problem  of  scattering  and  photoproduction  of  pions  from 
deuterons*  yields  results  in  good  agreement  with  experiments  in  the  low  energy 
region. 


1  H.  MiYAZAWA,  Phys.  Rev.  104,  1741  (1956). 

2  S.  Gartenhaus,  Phys.  Rev.  100,  900  (1955). 

3  P.  S.  SiGNELL  and  R.  E.  Marshak,  Phys.  Rev.  109,  692  (1958). 

^  V.  Devanathan  and  G.  Ramachandran,  Nud.  Phys.  23,  312(1961).  A.  Ramakrishnan,. 
V.  Devanathan  and  G.  Ramachandran,  Nud.  Phys.  24,  163  (1961).  A.  Ramakrishnan, 
V.  Devanathan  and  K.  Venkatesan,  Nud.  Phys.  (in  press). 


CHAPTER  V 


NON-PERTURBATIVE  APPROACH 
TO  QUANTUM  FIELD  THEORY 


We  have  seen  in  an  earlier  chapter  that  in  quantum  electrodynamics  the  prescrip- 
tions for  mass  and  charge  renormalizations  are  sufficient  for  obtaining  finite 
results  for  all  observable  quantities  in  every  approximation  of  the  perturbation 
expansion  of  the  /S-matrix.  If  we  could  further  show  that  the  series  so  obtained 
is  convergent,  we  would  have  a  satisfactory  theory.  Investigations  of  certain 
simplified  models  of  a  quantized  field  theory  have  shown  however  that  for  these 
models  the  series  actually  diverges  even  after  renormalization  has  been  carried 
out  and  hence  we  can  expect  a  similar  situation  in  quantum  electrodynamics 
also.^  Hence  it  has  been  found  desirable  to  develop  a  procedure  for  renormalization 
of  mass  and  charge  without  recourse  to  perturbation  expansions.  Kallen^  has 
carried  out  such  an  investigation  and  found  that  if  quantum  electrodynamics 
is  a  consistent  theory  it  contains  at  least  one  infinite  renormalization  constant. 

The  situation  is  much  worse  in  meson  theory  where  the  meson-nucleon  coupling 
constant  g^  is  not  small  {g^  ^  15)  and  hence  a  power  series  expansion  in  terms 
of  it  is  bound  to  be  invalid. 

Lehmann,  Symanzik  and  Zimmermann^  made  a  non-perturbative  approach 
to  the  problem  of  strongly  coupled  fields  using  only  renormalized  quantities.  The 
main  merit  of  their  approach  is  that  they  do  not  make  use  of  any  specific  theory 
of  interaction  as  given  by  a  Hamiltonian  but  base  their  theory  on  certain  postulates 
which  are  of  quite  general  validity.  These  postulates  which  refer  to  the  field 
operators  are  the  following: 

(1)  The  field  operators  shall  have  correct  transformation  properties  under 
the  inhomogeneous  Lorentz  transformations,  both  proper  and  improper,  such  that 
the  theory  is  invariant  with  respect  to  such  transformations. 

(2)  They  obey  the  principle  of  microscopic  causality.  The  relativistic  statement 
of  causality  is  that  events  at  different  space-time  points  cannot  affect  each  other 
unless  they  can  be  connected  by  a  light  signal.  Events  at  points  having  a  space- 
like separation  cannot  affect  each  other.  Since  in  quantum  mechanics,  the  non- 

1  C.  A.  Hurst,  Proc.  Camh.  Phil.  Soc.  48,  625  (1952).  W.  Thirring,  Helv.  Phys.  Acta  26, 
33  (1953).  A.  Petermann,  Phys.  Rev.  89,  1160  (1953).  Arch.  Sci.  Phys.  Nat.  6,  5  (1953). 

2  G.  Kallen,  "Quantum Electrodynamics",  Handbuch  derPhysik,  Vol.  5,  Part  I,  Springer- 
Verlag  (1958). 

3  H.  Lehmann,  K.  Symanzik  and  W.  Zimmermann,  Nuovo  Cim.  1,  205  (1955).  See 
also  H.  Lehmann,  Nuovo  Cim.,  Supplement  series  X,  14,  No.  1,  153  (1959). 
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interference  between  the  simultaneous  measurement  of  two  dynamical  variables 
is  expressed  by  the  vanishing  of  the  commutator  between  them,  the  principle 
of  microscopic  causality  is  imbedded  in  the  statement  that  the  commutator 
(anti- commutator  in  the  case  of  fermions)i  of  two  Heisenberg  operators  taken 
at  two  space-time  points  should  vanish  for  space-like  separation  of  the  two 

^''"'*^'  ''^*  [^  W,  ^  (2/)]  =  0,  (x  -  i/)2  >  0.  (1) 

(3)  The  field  operators  satisfy  the  asymptotic  condition,  viz.  that  the  field 
operator  ^  (ic)  shall  go  over  for  0:4  ->  —  cxd  and  0^4  ^  +  00  into  the  free  field  operators 
Aii^{x)  and  ^out(^)  respectively  which  obey  the  free  field  equations  but  with  the 
physical  masses  of  the  particles  appearing  in  them.^ 

The  asymptotic  condition  restricts  the  theory  to  only  such  local  fields  which 
do  not  allow  bound  states.  Only  the  creation,  annihilation  or  the  scattering  of 
particles  are  envisaged.  To  formulate  the  asymptotic  condition  suitably,  we  expand 
the  operator  A(x)  using  a  discrete  set  of  wave  functions  f^{x) 

A{x)  =  2]{Ux)  A^{x)^f*{x)  A-^x)}  (2) 


where  the  /^  {x)  satisfy  the  free  field  equation 

(a-,^)M=0     and     _i||/|£_/*|Ljd3.=  l 
From  (2)  we  have  ^_ 

d^xA{x)-^^Ux) 


S^4 


/ 


d'x\A{x)-^fAx)-^A{x)Ux)\ 
[  cx^  ox^  J 


(33) 


(4) 


with  Ai^{x)  and  ^out(^)  defined  correspondingly.  The  asymjjtotic  condition  for 
the  field  operators  can  now  be  written  in  the  form 

Lim   <0  \A''(x,)\Wy  =  /0  A  °  „t.   W)  (5) 

for  arbitrary  0  and  W. 
Hence 

<0  \A{X)   \0C,^   >  =  <0  MinWl^in   >  =  fAx)  (6) 

and  o^*^  °"<^ 

<OM(a;)|0>==  <0  \A,Ax)\Oy  ^  0  (7) 


^  At  first  sight  this  looks  as  if  fermion  fields  were  acausal  but  the  apparent  contradiction 
is  removed  by  noting  that  the  fermion  field  operators  always  appear  in  bilinear  combinations, 
for  all  observables.  Two  such  combinations  taken  at  different  space-time  points  can  be  shown 
to  commute  with  each  other. 

2  See  chapter  III,  where  however  the  point  of  view  is  taken  that  the  Heisenberg  fields 
in  the  remote  past  and  future  coincide  with  the  in  and  out  fields  only  up  to  certain  constant 
multiplicative  factors  (the  renormalization  constants). 

3  In  this  chapter  we  again  revert  to  the  metrix,  dn  =  ^22  =  ^33  =  ~  ^44  =  1  to  be  in  con- 
formity with  the  literature  on  the  subject. 
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Using  the  asymptotic  condition,  the  scattering  matrix  element  can  be  reduced 
to  the  vacuum  expectation  value  of  products  of  Heisenberg  operators  and  this 
reduction  is  necessary  since  we  can  impose  the  causality  condition  only  on  the 
field  operators.  Consider  the  scattering  of  a  particle^  from  a  state  of  momentum  h 
to  a  state  of  momentum  k'  while  the  target,  changes  its  momentum  from  "p  to  p'. 
The  S  matrix  element  for  the  process  is  given  by 

iv'  K  \^\  Pkn>  =  <P'  Kut  I  P  ha>  =  <P'  |«1|  P  Ky 

=    Urn    <p'|^(fe',  x^)\ph,^y 

(8) 
8 


/• 


dx,—-ip'\A{k\x,)\vky-^ 


—  00 

+    hm    ip\A{k',  x^)\'ph,^y, 

a;4->-  00 

where  a/out\  (^')  is  the  annihilation  operator  for  an  outgoing  (incoming)  particle 

Vin  ) 
with  the  commutation  relations 

a/in  xW,  «/in  Ak')]=2k^d{k--k') 

.    Uut^  Uut>/  J 

[a/in   \W,  tt/in  x(^')l  =0 
L    \out>/  Uut>/  J 

and  . 

«inW=^inW;       a/nW  =  -4in(-A^)       for       k^>  0 .  (10) 

^""^  <P'  \ain{k') I  p h^y  =  <p'  fcfn  I  p h^y .  (11) 

The  first  term  becomes 


m 


(12) 


after  partial  integration  with  respect  to  the  space  coordinates.  The  matrix 
element  in  the  integrand  can  be  evaluated  by  again  applying  the  asymptotic 
condition 

<p'\A{x)\pk,,y  =  (p'\A{x)(4r.{k)\py 

==    hmJp'\A{x)AHk,y,)\py 

=  \m   (p'\T{A  (x)  AHk,  y,)}\py  (13) 

2/4->-oo 

d 


=  -jdy,  —  {p'\T{A{x)AHk,  y,)}\py-i- 
+    lim  <:p'\T{A{x)AHk,y,)\py 

J/4->  +  00 


1  Throughout  this  chapter,  we  assume  both  the  target  and  scattered  particles  are  scalar 
unless  otherwise  stated. 

2  Notice  the  difference  in  normalization  between  this  and  of  the  creation  and  annihilation 
operators  defined  in  chapter  III. 


ifd^'y 
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The  last  term  gives 

(p'  \aU{1c)  A{x)\py^2  k,  6{p'  -h)<0\A(x)\v).  (14) 

which  vanishes  when  (Da;  —  fi^)  operates  on  it.  The  other  term  leads  to 

ip  Knt  I  V  hu)  -  <P'  ^in  I  P  ^in> 

=  ^    .3  /  d^x d*y exp  —  [ik'x  —  iky]  (15) 

(D,-fi^)(n,-fi'')<,p'\T{A{x)A(y)}\py. 
To  express  the  scattering  amplitude  in  terms  of  the  retarded  product  defined  by 
R(A(x)A{y))^-id(x,-y,)[A{x),A(y)]  (16) 

where  d{x^  — 1/4)  i^  ^^®  Heaviside  function,  we  again  start  from  (13) 

ip'  \A{x)\p  k,^y  =  {p'\A  {X)  aL  (k)  I  p}  =  (p'  I  [A  (x) ,  al  (k)]  \  p}  + 
+  <:p'\alr.{Jc)A{x)\p}. 

The  last  term  gives  2k^d(p^  —  fe)<0  \A (x)  \  p'}  which  again  vanishes  on  applying 
{\Dx  ~  I^^)-  Hence  we  obtain 

<p'|[4(x),a,Ui)]|p>=   lim  <:p'\[A{x);A*{k,y,)]\py.  (18) 

There  is  no  boundary  term  for  1/4  ->  +  oo  since  the  retarded  product  vanishes  if 
y^>  x^.  Proceeding  as  before  we  get 

<v'Kni\vKy  =  <v'Tc^s\pk[,y 

=  I  d^a: d^y exp  —[ik'x~iky]x  (19) 

(2  7t)   J 

X  {B,-  f,^){[Jy^  f^^)<:p'  \B{A{x)  A{y))\py. 

We  can  continue  to  take  particles  out  of  the  state  vectors  and  finally  reduce 
the  scattering  matrix  to  the  vacuum  expectation  values  of  time-ordered  or 
retarded  products   of  Heisenberg   operators. 

Though  the  scattering  element  can  be  expressed  in  terms  of  either  time -ordered 
or  retarded  products,  it  turns  out  that  the  Feynman  amplitude  (namely,  that 
which  involves  the  T  products)  does  not  have  such  convenient  analjrtic  properties 
as  the  retarded  amplitude  (which  is  expressed  in  terms  of  the  retarded  products) 
since  the  latter  is  the  Fourier  transform  of  a  function  vanishing  outside  the  forward 
light  cone  which,  as  we  shall  see  presently,  enables  us  to  write  dispersion  relations 
between  the  real  (dispersive)  and  imaginary  (absorptive)  parts  of  the  scattering 
amplitude.  But  before  discussing  the  general  mathematical  and  physical  premises 
from  which  such  relations  can  be  derived  we  shall  outline  the  derivation  of  the 
dispersion  relations  for  the  forward  scattering  of  a  meson  by  a  nucleon,  due  to 
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Goldberger^  who  established  such  relations  as  algebraic  identities  between  the 
absorptive  and  dispersive  parts  of  the  retarded  amplitude,  the  division  being 
made  according  to  whether  real  or  virtual  intermediate  states  are  involved,  i.e. 
whether  or  not  conservation  laws  are  satisfied  in  the  intermediate  as  well  in  the 
initial  and  final  states. 

To  realise  that  there  should  exist  a  relationship  between  the  absorptive  and 
dispersive  parts,  we  can  adopt  the  following  general  argument.^  The  absorptive 
part  of  the  forward  scattering  amplitude  at  a  particular  energy  arises  through  the 
real  intermediate  states  and  can  be  found  directly  from  a  knowledge  of  the  total 
cross-section  for  incident  particles  of  that  energy  by  using  the  optical  theorem. 
On  the  other  hand,  the  dispersive  part  arises  from  virtual  intermediate  states  the 
energies  of  which  are  not  equal  to  the  initial  energy.  Such  intermediate  states  contri- 
bute to  the  absorptive  part  for  an  initial  state  with  the  corresponding  energy.  This 
energy  can  take  any  value  and  therefore  the  dispersive  part  should  be  an  integral, 
over  all  energies,  of  the  total  cross-section.  This  is  precisely  the  dispersion  relation. 

Consider  the  forward  scattering  of  a  boson  of  four  momentum  q  and  isospin 
index  p  which  is  scattered  into  a  state  with  four-momentum  k  and  isospin  index  oc 
while  the  target  nucleon  goes  from  the  state  [p,  X)  to  the  state  (p',  X')  where  p 
and  p'  are  the  four  momenta  and  X  and  X'  represent  the  spin  and  isospin  variables 
of  the  initial  and  final  nucleon  respectively.  The  retarded  amplitude  can  be 

written  as 

M,^{o),  X'  X)  =  D,p{m  X'  X)-\-i  £(co)  A,p{aj  X'  X)  (20) 

where 

D,f(u;  X  ;)  =  ~yd*xe«<»-.cos(fe.a;)<p'A'l£(a;,)[j,(a:)jp(0)]-        ^^^^ 

-2  6(x4)[L(x),  ?)^(0)]|pA> 
and  - 

^,^(o>;  A';i)  =  j/d%e^--*cos(fe.a?).<2>'A'|[u(a:),  j^(0)]|??A>  (22) 

where  e{x^  is  the  sign  function  which  is  connected  to  the  step  function  in 

ri  +  eix  )1 
the  retarded  product  by  0  {x^  =■ ^     .  The  fs  are  the  usual  meson  current 

operators  in  the  Heisenberg  representation.  Splitting  D  and  A  further  into  parts 
symmetric  and  anti- symmetric  under  interchange  of  a:  and  /5  (denoted  respectively 
by  the  superscripts  1  and  2)  and  introducing  a  complete  set  of  intermediate  states, 
it  has  been  demonstrated  by  Goldberger  that 

^  n  J  (CO  —  ojq)  [m  ^  —  CO") 

-  00 


'-  M.  L.  Goldberger,  Phys.  Rev.  99,  979  (1955). 
^  J.  S.  Toll,  Phys.  Rev.  104,  1760  (1956). 
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The  generalized  dispersion  relations  (appropriate  for  the  scattering  of  particles 
with  mass)  followed  from  this;  but  as  this  derivation  is  not  considered  rigorous, 
we  shall  proceed  to  a  study  of  dispersion  relations  from  a  more  deductive  point 
of  view  based  on  certain  general  principles  and  prescriptions. 

Single  variable  dispersion  relations 

Single  variable  dispersion  relations  connect  the  real  and  imaginary  parts  of  the 
scattering  amplitude  considered  as  a  function  of  a  complex  variable  which  can 
be  either  the  energy  or  the  momentum  transfer.  They  can  be  derived  from  the 
causality  condition  which  imposes  certain  restrictions  on  the  scattering  matrix 
elements  from  which  the  dispersions  follow  either  by  the  application  of  the 
Cauchy  integral  theorem  or  by  a  theorem  due  to  Titchmarsh.  Here  we  start 
from  Cauchy 's  theorem  which  states  that  an  analytic  function  f(z)  of  the  complex 
variable  z  can  be  represented  as  an  integral  over  a  closed  contour  C  enclosing  the 
region  R  within  which  f{z)  is  analytic. 

f{z') 


# 


ds' =  2  7ri/(2)  if  2  Hes  inside  C 

(24) 

=  0      ii  z  lies  outside  C . 
If  z  lies  on  the  contour,  f(z)  can  be  represented  by  the  principal  value  integral 

/(^)  =  ^-^(f  #-^d^'.  (25) 

711  J       Z    — Z 

If  z  lies  on  the  real  axis,  the  contour  can  be  chosen  to  be  the  real  axis  itself  and 
a  semicircle  with  infinite  radius  in  the  upper  half-plane.  If  the  contribution  to 
the  integral  from  this  semicircle  vanishes  as  the  radius  tends  to  oo,  we  obtain 

oo 

iiA=^{^AJ.  (26) 

I  71   J      Z    — Z 

-  00 

This  immediately  impUes,  on  taking  the  real  and  imaginary  parts  on  both  sides 

—  00 

which  is  exactly  the  dispersion  relations  if  /  (2)  represents  the  scattering  amplitude 
as  a  function  of  energy  or  momentum  transfer.  Thus  the  validity  of  this  relation 
depends  upon  the  analyticity  of  the  function  /  (2)  in  the  upper  half- plane  and  the 
vanishing  of  the  contribution  from  the  semicircle,  i.e.  we  have  to  carry  out  an 
investigation  of  the  analytic  properties  of  the  scattering  amplitudes  and  the 
possibility  of  an  analytic  continuation  of  these  into  the  upper  half- plane.  If  such 
a  continuation  is  possible  and  the  resulting  function  /  [E)  as  a  function  of  energy  E, 
falls  off"  not  slower  than  a  constant  ( 1^  I )"  ^  as  1^  I  ->  00  we  see  that  the  contribution 
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from  the  infinite  semicircle  over  the  real  axis  (taking  the  contour  to  be  the  real 
axis  and  the  infinite  semicircle  in  the  upper  half-plane)  will  be  zero  and  hence 

we  will  have 

+  00 

-cx> 
+  00  ^ 

Re/(^)=^/^^d£'  (27b) 

-00 

which  will  be  the  required  dispersion  relation.  But  we  cannot  always  expect  the 
scattering  amplitude  to  be  sufficiently  bounded.  If/  (E)  has  a  pole  of  w-th  order  at  00, 

tl"-  -7^^%^  -^  ^«  -  -'y*'«  f-«-  in  the  upper  half-p.ane  and  we 
get  instead 


[E-E^r^^  „    r  Imf{E')dE' 


n  J 


(E'  -  E)  (E'  -  E^)"*'^ 


(28) 


+  B^/(^o)  +  ---+^'^"'^'''f~-^''^''         (-oo<E,E,<oo) 


where  /^"^  denotes  the  n-th.  derivative  of  /(£^o)-  From  (28)  we  can  immediately 
write  down  the  dispersion  relations  for  any  process  provided  we  know  the  analytic 
behaviour  of  the   scattering   matrix   element   as   a   function   of  energy.   The 

f{E) 
analyticity  of  —^ — ^      , ,    at  00  introduces  a  pole  of  order  n^  1  at  j^n .  Using 

(•^  —  ^0) 
the  properties  of  f{E),  the  integration  from  —  00  to  00  can  be  replaced  by  one  from 
0  to  00  over  the  entire  range  of  energy  from  0  to  00.  Since  particles  of  finite  mass 
cannot  take  energies  less  than  the  rest  energy,  the  above  equation  when  applied 
to  particles  with  finite  mass  will  involve  integration  both  over  the  physical 
region  (where  the  energy  is  >  the  rest  energy)  and  the  unphysical  region  (where 
the  energy  E  lies  in  the  interval,  0  <  ^  <  rest  energy).  Of  course  we  are  con- 
fining our  statements  to  scattering  processes  and  the  situation  is  more  complicated 
when  we  discuss  processes  involving  more  particles.  As  an  example,  let  us  consider 
the  forward  scattering  of  light  taking  f(E)  to  be  the  refractive  index  of  light 
in  an  absorbing  medium.  Setting  in  (28)  n  =  0,  Eq=  0  (since  the  photon  has  no 
mass)  and  Im.  f(E')  =  — Ini/( — E')  we  get  the  Kramers-Kronig^  formula 

Re[»(«)  -  nm  =  1^  f-^S^d.'.  (29) 


1  H.A.Kramers,  Atti.  Cong.  Inter.  Fisic.  Comt.  2,  545  (1927).  R.  Kronig,  J.  Opt.  Soc. 
Amer.  12,  547  (1926). 
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As  another  example,  consider  the  case  of  the  forward  scattering  of  a  neutral 

pion  by  a  nucleon  for  which  Karplus  and  Ruderman^  assume  that  the  scattering 

amplitude,  f{E)  is  an  analytic  function  in  the  upper  half- plane  and  that  its  pole 

at  infinity  is  of  an  order  not  higher  than  the  first.  Since  the  neutral  meson  field 

is  real,  we  have  Im/( — E)  =  Im/(^).  Also  for  forward  scattering  we  can  use 

k 
the  optical  theorem  Im  /  (E)  =  — —  a  where  k  is  the  momentum  of  the  meson 

and  a  the  total  scattering  cross-section.  Hence  setting  n  =  I  in  (28),  changing 
over  to  momentum  as  the  variable  and  excluding  the  non-observable  region 
0  ^  E  ^  juwe  get  • 

00 

Re[/(|fc|)-/(|feo|)]^(^/<^(^')d^'x{^J^--^^-^j,         (30) 

0 

which  is  the  dispersion  relation  for  the  forward  scattering  of  a  neutral  pion  by  a 
nucleon. 

The  derivation  of  relations  of  the  type  (29)  and  (30)  depends  on  the  possibility 
of  continuing  the  scattering  matrix  element  f{E)  into  the  upper  half  plane  and 
this  is  related  to  the  principle  of  causality  which  therefore  forms  the  physical  basis 
of  dispersion  relations.  To  reaUse  this  let  us  suppose  that  f{E)  can  be  written  as 

00 

f(E)=  f  f(t)e*^'dt.  (31) 

—  00 

The  condition  of  causality  is  imposed  on  f{t)  by  specifying  that  /(^)  =  0  for  ^  <  0. 
Then 


^J^^y^'-ff^'^"'^^"'' 


(32) 


For  t  >  0,  the  convergence  of  the  integral  is  assured  since  when  we  pass  to  the 
upper  half-plane,  f(E)  will  acquire  the  factor  exp( — t  |Im^|).  For  t  <C  0,  the 
integral  vanishes  because  of  the  causality  condition.  If  on  the  other  hand  f{t) 
vanishes  only  for  ^  <  — a  where  a  is  some  "elementary  length"  then  by  replacing 
/  by  ^  —  a  in  the  integrand  oi  f(E)  we  shall  have 

f{E)  =  e-'^^f,(E),  (33) 

where  the  function  fi{E)  will  have  no  essential  singularity  at  infinity  while  the 
factor  e-'^"'^  will  have  such  a  singularity  when  E  ->  oo.  Hence  if  we  wish  to  write 
a  dispersion  relation  for  this  case  we  must  multiply  /  (E)  by  exp  (iocE)  where  oc  ^  a. 
Thus  an  experimental  observation  of  a  deviation  from  the  usual  type  of  dis- 
persion relation  will  provide  evidence  for  the  necessity  of  a  non-local  theory. 


1  R.  Karplus  and  M.  A.  Ruderman,  Phys,  Rev.  98,  771  (1955). 
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Goldberger's  relations  discussed  earlier  were  found  to  hold  in  the  physical 
region  of  energies  but  his  expressions  diverge  for  E  <C  fJt-  Oehme^  returned  to 
the  study  of  the  analytical  properties  of  the  scattering  amplitude  and  succeeded 
in  carrying  out  the  analytic  continuation  of  the  amplitude  into  the  upper  half- 
plane  by  assuming  its  quadratic  integrability.  This  however  is  too  restrictive 
an  assumption  for  physical  amplitudes  in  which  improper  functions  such  as  the 
^-function  can  possibly  occur.  Recently  however  rigorous  proofs  of  dispersion 
relations  have  been  given  for  the  case  of  the  forward  scattering  of  a  boson  with 
mass  by  Symanzik^  and  for  the  scattering  of  such  a  particle  at  any  angle  by 
Bogoliubov^  etal.  and  Bremmerman  et  al.*  who  use  theorems  in  complex  variable 
theory  for  their  proofs.  The  difficulty  involved  in  the  latter  approach  can  be  seen 
from  the  following. 

According  to  a  theorem  due  to  Wightman,  Hall  and  Bargmann^  if  a  function 

fiqi,  .  .  . ,  qn)  of  n-fonr  vectors  q^,  .  .  . ,  g„  is  an  analytic  function  of  the  variables  q^ 

and   is  invariant   under  both  proper  and  improper  Lorentz   transformations, 

then  fiqi,  .  .  . ,  q,i)  is  a  function  of  the  scalar  products  qj ,  qj^  and  is  analytic  in  the 

n{n  —  1) 
corresponding  domain  of  the  variables  qj  •  qj^  which  will  be in  number. 

For  the  case  of  scattering  of  one  particle  by  another,  there  will  be  four  vectors 
and  six  scalar  products  involved.  Functions  of  single  complex  variable  are  simple 
in  that  there  exists  for  every  domain  D  a  function  f(z)  which  is  analytic  only  in 
that  domain  and  hence  cannot  be  continued  outside  D.  However  for  functions 
of  more  than  one  complex  variable  any  arbitrary  domain  will  not  in  general  be 
the  domain  of  analyticity  for  an  analytical  function  and  there  exists  the  possi- 
bility that  all  functions  of  many  complex  variables  which  are  analjrbic  in  D  can  be 
analjrtically  continued  simultaneously  into  larger  domains.  The  largest  such 
domain  in  which  the  class  of  functions  are  analytic  is  called  the  envelope  of  holo- 
morphy  E{D)  of  the  domain  D.  The  problem  of  obtaining  this  is  very  difficult. 
Bremmermann  et  al.  however  succeeded  in  computing  the  holomorphy  for  a 
suitable  sub-domain  of  the  six  complex  variables  involved  in  scattering  and  were 
also  able  to  demonstrate  enough  analytic  properties  of  the  scattering  amplitude 
to  be  able  to  prove  the  dispersion  relations  for  n  —  N  scattering  and  for  several 
other  cases  with  restrictions  on  the  momentum  transfer  or  on  the  ratio  of  the 
masses  of  the  scattering  particle  and  target. 

There  is  however  another  approach  to  the  problem — that  of  obtaining  an  inte- 
gral representation  for  the  scattering  amplitudes  for  real  values  of  the  arguments. 


1  R.  Oehme,  Phys.  Rev.  97,  1392  (1955). 

2  K.  Symanzik,  Phys.  Rev.  105,  743  (1957). 

^  See  Introduction  to  Theory  of  Quantized  Fields  by  N.  N.  Bogoliubov  and  D.  V.  Shirkov, 
Interscience,  New  York  (1959).  Bogoliubov  avoids  the  explicit  use  of  the  theory  of  several 
complex  variables  by  employing  parametrizations  and  using  distribution  methods. 

*   Bremmermann,  Oehme  and  Taylor,  Phys.  Rev.  109,  2178  (1958). 

^  Wightman,  Hall  and  Bargmann,  K.  Danske.,  Vidensk.  Mat.  Fys.  Medd.  31,  No.  5, 
(1957). 
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The  advantage  of  this  approach  is  that  the  analytic  continuation  to  complex 
values  becomes  trivial  and  the  analytic  properties  of  the  scattering  amplitude 
are  readily  apparent.  This  approach  which  would  thus  be  the  converse  of  the 
previous  one,  was  used  by  Dyson ^  who  solved  the  problem  of  constructing  a 
general  integral  representation  for  a  scattering  matrix  element  and  by  Lehmann^ 
who  used  the  Dyson  representation  for  studying  the  analytic  properties  of  the 
scattering  matrix  element.  We  shall  summarize  the  important  steps  and  the 
results  obtained. 

The  main  objects  of  study  are  the  vacuum  expectation  values  of  products 
of  field  operators  whose  structures  are  to  be  determined  from  the  following 
general  principles : 

(1)  Transformation  properties  of  the  operators  under  Lorentz  transformations, 

(2)  The  causality  condition, 

(3)  The  asymptotic  condition, 

(4)  The  absence  of  negative  energy  states, 

(5)  The  spectral  condition  which  states  that  all  the  physical  states  of  the 
universe  form  a  complete  set  of  basic  vectors  in  the  quantum  mechanical  sense, 
each  state  being  associated  with  a  total  energy-momentum  vector  which  is  positive 
time -like. 

The  integral  representation  for  the  two-fold  vacuum  expectation  values  of 
operators  is  easily  derived,  e.g. 

^(p{x)(p{x')y,==A^^'^  (x-x^)  (34) 

can  be  written  as 

Zl(+)'(a:  -  x')  -  -—"-^3  fd{k^)  P{k^)  e-'*(^-^)d^A:.  (35) 

The  prime  on  A  denotes  that  the  operators  are  in  the  Heisenberg  representation. 

Here  ^^^2)  ^  (27r)^  v<0  W{0)  \  ny^ii  \(p{0)  \  0>.  (36) 

n 

On  using  the  integral  representation  for  the  free  field  function,  A^^'^{x  —  x') 
we  obtain  the  integral  representation 

00 

A<<^y {x  -  x')  =  j P{k'')  A<^\x  -  x' ,  k^)dk^.  (37) 

0 

The  problem  of  finding  an  integral  representation  for  the  vacuum  expectation 
value  of  products  of  three  or  more  operators  is  difficult  and  has  not  been  solved 
completely.  Dyson  however  succeeded  in  solving  the  problem  of  an  integral 
representation  for  the  function 

fix)  =^  <Q\[A  (x/2),  B{~xl2)]  I  P> 

1  F.  J.  Dyson,  Phys.  Rev.  110,  1460  (1958).  See  also  V.  Ya.  Fainberg,  JETP  36  (9), 
1066  (1959). 

2  H.  Lehmann,  Nuovo  Cim.  10,  578  (1958). 
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which  is  immediately  comiected  with  the  matrix  element  of  a  scattering  process. 
He  showed  that  if  the  function  /  (x)  has  the  following  properties : 

f(x)  =  0    a:^  >  0  (causality  condition) 


or 


f{q)  =  f  e'ff^  /(a;)  d^a;  =  0  unless 


(38) 


spectral  condition     (39) 


2  ^/       -2- -a'  2 

where  m^  and  m^  are  the  smallest  masses  of  the  intermediate  states  for  which 


and 


<^|^(0)|Zi>H=0;     <l,\B{0)\Py^O 
<Q\B{0)\ky^O;     <k\A{0)\Py^O 
respectively,  then  f{q)  can  be  represented  by 

00 

fiq)  --  j  dx"  j  ^'u0(x\  n)  8{q  -  u)  d[{q  -  u)^  -  x^] 

0 

if  and  only  if  cp  {y^,  u)  vanishes  everywhere  except  for  the  region 


(40) 
(41) 

(42) 


X  ^  max  jo,  m^  — 


4-M 


\l±Q 

mo  —    — u 


(^±.)'«. 


For  the  retarded  product,  this  leads  to  the  representation 


/-(*)--i-/^^»/'^^"-te 


nY  -  x" 


(43) 


(44) 


Lehmann  used  the  above  integral  representation  for  deriving  the  region  of 
-analjrticity  of  the  scattering  amplitude  and  the  restrictions  on  the  momentum 
transfer  for  proving  the  dispersion  relations  for  the  scattering  process 

p^  k^f'  -^  k',  (45) 

the  scattering  amplitude  for  which  is  given  by 

i(k  +  k')'X 


T==-  fd^x 


exp 


where 


^■Wi)-(-f)l 


R^{A  [x/,)  A  (-xl2)}  =-  e  {X,)  [i  (a:/2)  j  (-a:/2)] . 


(46) 


(47) 


This  is  defined  not  only  on  the  energy  shell  k^  =  k'^  =  ju^  but  for  arbitrary  real 
vectors  k,  k'.  The  variables  we  can  choose  are 


W^={p  +  kf; 


-{k-  k'f 


^  =  jc^  =  jc'^, 


(48) 
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where  the  first  two  correspond  to  the  square  of  the  total  energy  and  the  negative 
of  the  square  of  the  momentum  transfer  in  the  centre  of  the  mass  sj^stem 
respectively.  Instead  of  W^,  we  can  also  use  the  variable 

.  =  Jl±pMlL,  (49) 

which  is  connected  to  W^  by 


Tf2_2wl/zl2  +  m2  +  2zl2  +  m2  +  A;2.  (50) 

Consider  the  expression 

i{k  +  k')x 


M  =     d^^rexp 


<?>'lir/2)i(-72)l2»  (51) 


2 

=^{2  7t)'<p'\m\v  +  ky(^p  +  k\m\py, 

which  is  related  to  the  imaginary  part  of  the  scattering  amplitude  by 

ImT{a),  A\^)-=i{M{co,  A\  $)-M{-a},  A^,  ^)},  (52) 

where  M( — (jo,A^,^)  can  be  obtained  from  M{o},A^,i)  by  the  transformation 

k->—k',    k'->—k,    p^p\    p'->p.  (53) 

(Such  a  relation  between  M{+co)  and  M[ — co)  is  essential  for  converting  the 
integration  over  the  negative  energies  into  one  over  the  positive  energies  only.) 
In  the  Breit  frame  for  the  target  particles  in  which  p  -{-  p'  =  0  the  T-matrix 
can  be  written  as 

T{(jo,A^,  I)  =  -fd^x  ex]^[imx^-ie'X'  ]/co^  -  A^  -  ^] 

<-p\R'{A{%)An,)}\py,  (54) 

where  e  is  the  unit  vector  perpendicular  to  p.  All  the  co  dependence  of  T  lies  in 
the  exponential.  Set 

$  =  ii  +  ii2>     0}  =  oj^  +  ico^. 

Then  (54)  defines  an  analytic  function  of  the  complex  variables  co  and  |  regular  in 

o>2>0,         (ol>i,-{-A^',         Imco>Iml/co2_^2_|  (55) 

or     2co.2{co,-]/a)l-  i^-  A^)<^^<2co^{a),^]/col-  i,-  A^).  (56) 

If  we  chose  I  =  ^1  <  — A ^  then  T  is  regular  in  Im  co  >  0  and  if  T  is  sufficiently 
bounded  we  have  the  relation 

00 

rpt       A2    t^        ^      Ca      Imy(co^zl^  ^1) 

T{cjo,  A^,  ij)^ —       dco r 

71     J  CO^  —  CO 

-  00 
00 

=  —    r  dco' if  (o)',  A^  li)  \-^ 4-  -TT— }•  (^'^) 

jc    J  I  CO  —  CO         co'  +  0)] 

-00 

(A  term  is  is  understood  in  the  denominator.) 

EPCR      14 
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Lehmann  next  shows  that  this  relation  can  be  continued  analytically  to  the 
mass  shell  |  =  ^u^.  Further  from  (51)  and  the  corresponding  definition  of 
M{—  W,  Z|2, 1)  one  notes  that  M(W,A^,^)  has  a  pole  for  W^  =  {p  +  kf  =  m^ 
and  is  zero  unless  [f  +  kY  ^  (w  +  [j)^-  M{ —  W,  A^,  i)  has  a  pole  for  w^  =  mr 
=  {p  —  kY  and  is  zero  unless  W^  >  (m  +  jn)^.  The  contributions  from  these  can 
be  shown  to  be  given  by 

M(co,  A^  I  -  gm)d(W^'  -  m2)  (58) 

M(-co,  A\  ^)  ==  gr2(|)  d(W^  -  m2),  (59) 

where  g^  is  apart  from  constant  factors  the  conventional  coupling  constant. 
Changing  back  from  w  to  W^  as  the  energy  variable,  we  get  finally  the  dispersion 
relation  for  the  scattering  process 

1  1 


ReT(PF2,  zJ2)==^ 


+  4~      f  dW'^M{W'^  A\  ^) 

271        J 


1 


(60) 


+ 


W'^-W^       W'^-4:A^-2{m^-^jLi^)-^rW^\ 

To  obtain  the  region  of  validity  of  the  dispersion  relation  we  have  to  study 
the  analyticity  properties  of  M.  Using  Dyson's  integral  representation  for  the 
matrix  elements  of  the  retarded  products  appearing  in  M  it  can  be  proved  that 
for  the  validity  of  the  dispersion  relation,  the  momentum  transfer  must  be 
restricted  by  the  inequality 

W"  —  (mj  —  m.2r 
where  K  is  the  particle  momentum  in  the  centre  of  the  mass  system  and  is  given  by 

For  the  case  of  pion-nucleon  scattering  it  can  be  shown  ^  that 

m^  =  3/^ ,  mg  =  m  +  /^    and  we  have     W  ^  m  +  fi.  (63) 


Hence 


8        2m  +  /,i 

A^^—^a"^- ^,  (64) 

3        2m  —  ju 


which  means  that  the  dispersion  relations  for  the  scattering  amplitude  as  a  function 
of  energy  can  be  proved  rigorously  for  fixed  momentum  transfer. 

For  nucleon-nucleon  scattering,  however,  for  which  /j,  =  m  =  nucleon  mass, 
m^  =  m  -\-  m^ ,  m2  =  m^  and  PF  ^  2/^  we  have 

A^<AL.^ml-»i^+  '^"  +  "'">\  (65) 


^  See  e.g.  N.  N.  Bogoliubov  and  D.  V.  Shirkov,  Introduction  to  the  Theory  of  Quantized 
Fields.  Interscience  Publishers,  New  York  (1959)  chap.  IX. 
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which  is  positive  only  if 

m„^(]/2  — l)m.  (66) 

Thus  even  for  forward  scattering,  a  dispersion  relation  cannot  be  proveci  in  this 
case. 

Dispersion  relations  for  other  quantities  besides  scattering  amplitudes  have 
been  studied.  The  following  table  due  to  Goldberger^  summarizes  the  present 
situation  regarding  the  dispersion  relations  for  various  processes  and  vertex 
functions. 

(a)  Proved  relations 


Process 

Limitation  on  invariant 
momentum  transfer 

Continuation  of  the 

imaginary  part  into 

the  unphysical  region 

by  convergent  partial 

wave  expansion 

n-rN-^ji  +  N 

O^zl^^zlL, 

7i-\-  n^n  +  71 

4Lx  =  V 

o^zi-^<ziLx 

• 

J.            o|(2m+//)^ 

2m  +  /.] 
m       J 

o^^^<4Lx 

V  +  iV^TT  +  iV 

e^N  ^e-^7i  +  N 

^Lx-3/.^ 
^(-9/^2)  _  6^2 

zif,^J^<ziLx 

'*       m+fi,          4 

(b)  Some  unproved  relations 

Process 

Mass  restrictions;  A^  =  0 

Perturbation  theory  ^ 
(every  finite  order) 

,V  +  iV  -^  iV  4-  iV 

ju>{f2-l)m 

proved  for  Zl^  <  f^'/^ 

K  +  N-yK  +  N 

Comphcated;  not  ful- 
filled by  narrow  margin^ 

n  +  D  ^  TT  +  D 

e>Y'  Mj,  =  2m-e 

1  M,  L.  GoLDBERGER,  International  Conference  on  High  Energy  Physics^  CERN  (1958), 
Ed.  B.  Feretti,  pp.  208,  209. 

^  It  may  appear  odd  that  one  should  try  to  prove  dispersion  relations  in  perturbation 
theory  when  the  aim  of  the  approach  is  to  get  away  from  it,  but  it  is  expected  that  a  study 
of  the  analyticity  properties  of  the  individual  terms  of  the  perturbation  series  may  serve  as 
a  guide  for  the  more  complicated  situation. 

^  Numerical  calculation  gives  the  condition  M  ^  8.573,  which  again  renders  a  proof 
impossible  in  this  case. 

14* 
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(c)   Vertex  functions 


^     Vertex 

Mass 
restrictions 

Remarks 

Perturbation  theory 
(finite  order) 

<N\{C}^f^^)q>„\Ny 

yu>  (|/2-  l)m 

Representations 

do  not  follow 

from  causality 

and  spectrum 

alone 

Proved 
Proved 

<:r|(n-f-/i*)9)„|^> 

Proved 
Proved 

Lehmann  has  also  proved  that  the  real  and  imaginary  parts  of  the  scattering 
amplitude  as  a  function  of  the  momentum  transfer  (or  equivalently  the  angle 
of  scattering  in  the  c.m.s.)  are  analytiq  within  certain  ellipses  which  include  the 
physical  region.  The  region  of  analjrticity  for  the  imaginary  part  is  greater  than 
that  for  the  real  part  and  is  given  by  an  ellipse  with  foci  at  ±  1  and  with  semi- 
major  axis 


Zr 


2  {ml  —  m^)  (m|  —  ju} 


K^iW^-im^-m^f] 


(67) 


This  is  a  particular  case  of  a  more  general  representation  due  to  Mandelstam 
(see  next  section).  We  conclude  this  section  with  some  applications  of  the  dis- 
persion relations  with  energy  as  the  variable. 

(1)  A  truncation  of  the  dispersion  relations  for  meson-nucleon  interactions 
(scattering  and  photoproduction  of  mesons)  on  the  basis  of  the  dominance  of 
the  (3/2,  3/2)  resonance,  whereby  high  energy  phenomena  are  excluded  from 
consideration,  leads  to  approximations  which  can  be  considered  a  relativistic 
generalization  of  Chew's  static  model  but  which  are  superior  in  that  they 
satisfy  Lorentz  and  gauge  invariance  and  also  take  recoil  corrections  into 
account. 

(2)  The  use  of  phase-shift  analysis  for  the  reduction  of  experimental  data  on 
meson-nucleon  scattering  does  not  lead  to  a  unique  determination  of  the  phase 
angles  but  yields  a  number  of  possible  sets  of  these  angles.  The  dispersion  relations 
give  a  number  of  additional  equations  which  enable  us  to  choose  the  correct 
system  of  phase  angles  as  mentioned  earlier. 

(3)  The  dispersion  relations  provide  a  possibility  for  an  experimental  test  of 
the  existence  of  an  elementary  length. 

(4)  The  dispersion  relations  lead  to  a  precise  definition  of  the  pion-nucleon 
coupling  constant  which  appears  as  the  residue  of  poles  in  the  pion-nucleon  and 
nucleon-nucleon  scattering  amplitudes  (see  the  next  section). 
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The  Mandelstam  representation^ 

It  was  mentioned  in  the  last  section  that  proofs  for  dispersion  relations  for 
scattering  processes  can  be  given  only  for  restricted  values  of  the  momentum 
transfer  and  the  imaginary  part  of  the  scattering  amplitude  is  analytic  within 
a  certain  ellipse  in  the  complex  momentum  transfer  plane.  But  this  latter  region 
does  not  extend  much  further  than  the  limitation  already  present  in  the  derivations 
of  the  dispersion  relations.  Mandelstam  has  made  an  intuitively  appealing  con- 
jecture that  the  scattering  amplitude  is  an  analytic  function  of  both  energy  and 
momentum  transfer  variables  with  singularities  only  on  the  real  axis  of  these  two 
variables.  This  leads  to  a  simple  integral  representation  of  the  scattering  amplitude. 
The  representation  has  been  verified  up  to  fourth  order  in  perturbation  theory.^ 

Let  us  start  by  noting  the  important  consequences  of  the  "substitution  law" 
of  perturbation  theory.  Consider  the  following  processes  in  which  pions  and 
nucleons  can  take  part : 


(1) 

(2) 
(3) 
(4) 
(5) 


7T  +  TC^-^Tl  +  71 
71  +  Tl^^N  +  N 
7Z+N^->7C-rNoY7l+    N  ^^71 

N  +  N<^N  +  N 
N  +  N^N  +  N 


N 


(68) 


These  can  be  represented  diagrammatically  by  Figures  29(a),  (b)  and  (c),  if  we 
observe  the  convention  that  all  incoming  particles  have  a  positive  time-like  four- 


(a) 


(b) 


Fig.  29.  Graphs  for  the  pion-nucleon  system. 


1  S.  Mandelstam,  Phys.  Rev.  112,  1344  (1958);  115,  1741,  1752  (1959).  T.  W.  B.  Kibble, 
Phys.  Rev.  117,  1159  (1960);  see  also  G.  F.  Chew,  Ann.  Rev.  Nucl.  Sci.  9,  29  (1959). 

2  Eden  has  given  a  proof  of  the  Mandelstam  representation  valid  for  all  orders  of  per- 
turbation theory:  R.  J.  Eden,  Phys.  Rev.  Lett.  5,  213  (1960);  see  also  R.  J.  Eden  et  al.,  Phys. 
Rev.  122,  307  (1961). 
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momentum  and  that  going  opposite  to  the  direction  of  an  arrow  means  changing 
particle  to  antiparticle.  Figure  29(b)  includes  both  the  processes  (2)  and  (3)  while 
29(c)  includs  processes  (4)  and  (5)  so  that  processes  (1)  to  (5)  are  really  different 
manifestations  of  three  fundamental  matrix  elements  represented  by  Figures  29  (a), 
(b)  and  (c). 

For  two  incoming  and  two  outgoing  particles,  we  can  construct  two  invariants 
corresponding  to  the  total  energy  and  angle  of  scattering  in  the  c.m.s.  Define 
three  invariants  ,  vo       /  x> 

«1  =    iPl  +  Pi)"  =    iP2  +   Ps)' 


with 


^2  =   (P2  +  PiY  =--   [Pl  +  Pif 

«3  =   {Pz  +  Pif  =   iPl  +  P2)' 
4 
5l  +  <52  +  ^3  ^    2j^i-=K 


(69) 


Hence  only  two  of  the  three  s  variables  are  independent.  Each  is  the  square  of 

the  total  energy  W^.  in  the  c.m.s.  for  a  particular  pairing  or  "channel"  of  incoming 

and  outgoing  particles.   (For  each  diagram  there  will  be  three  channels.)  For 

instance  if  p^  and  ^2  correspond  to  the  incoming  particles  and  — p^  and  — p^ 

to  the  outgoing  particles  (process  I)  then  the  total  energy  is  ^s^;  Sg  will  be  the 

negative  of  the  square  of  the  momentum  transfer,  A^,  while  s,  in  general  may 

involve  both  W^  and  A^.  For  processes  II  and  III  in  which  the  incoming  particles 

are  1  and  4  and  1  and  3  respectively  the  total  energies  will  be  j/^^  and  j/^g. 

The  condition  for  a  physical  process  to  occur  may  be  expressed  in  terms  of 

^1,  ^2  ^nd  S3.  In  order  that  one  of  the  physical  processes  (I,  II  or  III)  may  be 

energetically  possible  for  a  given  set  of  values  of  the  scalar  products  PiPj,  we 

must  have  ,  -^  /nn   \ 

±PiPj>mimj.  (70a) 

The  positive  sign  is  to  be  taken  if  the  particles  i  and  j  are  both  incoming  or  both 
outgoing  and  the  negative  sign  if  it  is  otherwise.  Further  the  requirement  that 
the  scattering  angle  be  real  can  be  stated  in  the  form^ 


0 


Pl  Pl  'P2     Pl'  P^ 

P2 '  Pl     Pl  P2 '  Pz 

Pz-Pl       P3'P2       Pl 

In  terms  of  the  variables  s^,  s^  and  53,  (70a)  and  (70b)  can  be  written  as 

S3  >  (m^  +  m^)^  or  s^  <  {m^  —  mg)^  etc. 

and  s^S^S^  >  (Si  +  ^2  +  ^3)^(««3  +  ^-^2  +  csi) 

respectively.  In  (72)  the  dimensionless  constants  a,  b,  c  are  given  by 
K^  a  =  (mf  m|  —  w|  m|)  (m|  -f-  w|  —  ml  —  ml) 
K^h  =  (mj  ml  —  m|  mj)  (mf  +  m^  —  m|  —  m|) 
K^c  =^  (mf  m|  —  ml  m|)  (mf  +  w|  —  m^  —  m|) 


(70b) 


(71) 
(72) 


(73) 


1)  T.  Kibble,  Ph?/s.  Rev.  117,  1159  (1960). 
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The  variables  Sj ,  s^  and  s^  may  now  be  considered  as  homogeneous  coordinates 
in  a  plane  in  which  the  line  at  infinity  is  5^  +  ^g  +  -^a  =  0.  The  region  (72)  is 
bounded  by  a  cubic  curve  in  this  plane,  the  asymptotes  of  which  are  5^  =-  0,  Sg  ==  ^ 
and  ^3  =  0.  The  curve  intersects  its  asymptotes  on  the  line  as^  +  bs^  +  68^  =  0. 
The  bounding  lines  of  the  region  defined  by  (71)  are  tangential  to  the  curve  the 
shape   of  which   is   given   in  Fig.  30.   If  we  assume  that  m^^m^^m.^  m. 
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Fig.  30.  The  physical  regions. 


(which  is  satisfied  in  practice)  we  have  a'^h^  c.a  and  h  are  positive  while  c  may 
have  both  signs.  The  regions  I,  II  and  III  are  the  physical  regions  for  the 
three  scattering  processes  (the  region  IV,  corresponding  to  the  decay  process 
1  ->  2  +  3  +  4  can  occur  only  if  m^  >  mg  +  mg  +  ^4 .  We  shall  not  be  considering 
it  here).  We  note  that  the  entire  boundary  of  each  of  the  regions  I,  II  and  III  is 
a  part  of  the  cubic  curve. 

Now  we  can  conjecture  from  the  substitution  rule  as  follows :  A  single  analytic 
function  describes  all  three  channels  contained  in  the  same  diagram  (Fig.  29).  The 
physical  amplitude  for  the  process  when  1  and  2  are  the  incoming  particles  is  the 
boundary  value  of  an  analytic  function,  as  the  variable  s^  approaches  the  positive 


216  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

real  axis  from  above  in  its  physical  energy  range  with  one  of  the  other  two  variables 
held  fixed  at  a  physical  value  in  the  momentum  transfer  range.  The  ampUtude 
for  the  cases  when  1  and  3  and  1  and  4  are  considered  as  incoming  particles 
are  got  from  corresponding  limits  on  the  same  function  taken  with  the  variables 
^2  or  s^  respectively. 

To  derive  the  Mandelstam  representation  from  this  fundamental  assumption 
we  can  restate  the  latter  in  the  form ;  There  exists  a  function  A  (z^ ,  22 »  ^s)  o^  *^^ 
three  complex  variables  (only  two  of  which  are  independent),  which  is  analytic 
except  in  the  following  regions. 

(i)  Zj  =  <%!     or     Zj  (real)  >  a^     for  all     z^ 

(ii)  ^2  "^  ^2     or     Z2  (real)  >  a^     for  all     23  (74) 

(iii)  23=^3     or     23  (real)  >a3     for  all     z-^. 

All  the  as  and  a's  are  real  and  positive  and  correspond  respectively  to  poles  and 
branch  points  for  the  three  interactions.  Thus  in  the  case  of  n-N  scattering,  for 
instance,  we  have 

oci=(x^  =  M^;     «!  =  ftg  =  (if  +  ju)^;     ttg  ^  ^A*^ 

but  there  is  no  isolated  pole  in  Zg . 

To  obtain  the  Mandelstam  representation,  we  have  to  assume  further  that 

«!  +  0^2  +  <*3  >  ^>  (75) 

which  is  evidently  fulfilled  in  the  n-N  case.  We  will  write  down  the  dispersion 
relations  for  the  function  A  {z^ ,  z^)  assuming  that  it  is  well-behaved  at  infinity. 
Let  us  fix  the  variable  Z2  (we  assume  for  definiteness  that  Im  (22)  >  0)  and  con- 
sider A  (Zi ,  22)  as  a  function  of  z^  alone.  An  application  of  the  Cauchy  theorem 
gives  the  dispersion  relation. 


Aiz„,,)  =  -^l^+  '^^(^^) 


00 

A^  [XQy  Z2)  dx^ 


71  J         X{  —  Z^  7T  J 


{K-Z^-X'^) 


(76) 


(The  cuts  will  not  meet  each  other  since  Im  22  4=  0).  The  functions  A^  and  A^ 
are  the  jumps  across  the  cuts  of  the  function  A.  For  example 

A  (^i»  22)  =  -^  lim  [A  {x^  +  ^e,  z^)  -  A{x^-iE,  z^)'] .  (77) 

^*  e->o 

h^  and  h^  are  the  residues  of  the  function  A  at  the  poles  oc-^  and  ^  —  23  —  ^3 . 
To    simphfy    the   notation  we   can  include  the   contributions  from  the  poles 
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into  the  integrals.  For  this  purpose  we  replace  the  spectral  functions  A^  and 
^■laby 

Ai  (xi,  22)  ^  <5  (xi  -  «,)  /ii  (Z2)  +  Ai  {x{,  Zj)        1 

A  (4»  22)  ->  -  ^  (^3  -  ^3)  h  (^2)  +  ^3  (4»  ^2)  I 


so  that  we  have 


^3  (^3  >  ^2)        ^^, 

(iC  -  ^2  -  a;^)  -  2l  ^' 


(79) 


For  real  values  of  z-^  appearing  in  (77)  both  cuts  in  the  z^  plane  are  along  the 
real  axis.  The  condition  (75)  guarantees  that  the  two  cuts  in  the  Sg  plane  do  not 
meet  (as  Xj  >  aj).  Using  (77)  we  obtain 


n  J        x^  —  22 

oo 
71     J     {K-X^-X^)-Z2  ^ 


(80) 


where  ^23  ^^^  ^12  ^^®  again  jumps  (barring  the  residues  at  the  poles)  across  the 
cuts. 

^23  (^1 J  ^2)  =  IT^  li«i  [^1  (^1 »  ^2  +  *'£)  -  ^1  (^1 ,  0^2  -  ^  e)] .  (81) 

^*  e->o 

From  (77)  and  (81)  we  have,  for  x-^y  a^,  x<^y  a^ 


^23  (^1  >  ^2)  =  —  -r    li"^    [^  (^1  +  *^»  x^  +  iri) 

4   e,  »7-).0 

+  ^  (o^i  —  ^£,  X2  —  ir]) 

—  A  {Xi  +  isy  X2  —  irf) 

—  A{x^~ie,X2  +  ir})] 


(82) 


A  similar  representation  can  be  written  for  A^ 

A     (^'     y  \  -     ^      r^^2^13(^2>  ^3) 

^3 1^3 »  ^2;  —        /  —,      ; 


\    r    da;{^i2(^3>^i) 
J  {K-xi-  xi)  -  z. 


Using  the  identity 
1 


J\.        X-i  '   ■  x^ 


n 


1  1 

+ 


(83) 


1 


[X^   -    Zi)    (X3  -    Zg) 


(84) 
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and   combining  with  (79)   and   (80),  we  finally  get  on  changing  over  to  the 
6"- variables,  the  Mandelstam  representation 


(85) 


Fig.  31.  The  boundaries  of  the  region  in  which  the  spectral  functions  are  non-zero. 


where  the  weight  functions  A.ij{sl ,  s-)  are  real  and  the  integral  in  each  s'  variable 
goes  over  the  region  of  positive  real  axis  extending  to  infinity.  (In  the  denominator 
a  small  negative  imaginary  part  is  to  be  understood.)  It  is  essential  to  specify 
the  region  where  the  spectral  functions  differ  from  zero.  Mandelstam  has  given 
the  precise  boundaries  for  different  processes  on  the  basis  of  unitarity  and  pertur- 
bation theoretic  arguments  (see  Fig.  31).  Cg^g^,  Cg^g^  and  Cg^g^  are  the  boundaries. 
^3aj  ^2a  ^^^  Si^  are  the  lower  limits  of  the  corresponding  variables,  s^,  Sg  and  s^^ 
occurring  in  equation  (85). 
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If  poles  occur  in  the  scattering  amplitude,  their  contributions  must  be  added 
to  (85).  Holding  one  s  variable  fixed  at  a  real  value  outside  its  energy  range  and 
carrying  out  one  of  the  two  integrations  in  (85),  we  arrive  at  the  ordinary  single- 
variable  dispersion  relations, 

.4  =  1  fd.; ^^l^l±^ll^  +  ^f<is-  ^^'^r^^'^^'  (86b) 

^  =  -L  Ids',  _Mili-^  +  _L  Td.;  AffilAlisL,  (86e) 

n  J  ^  <5j  —  «!  71  J  «2  ~  -^2 

For  this  there  should  be  the  corresponding  relations  for  the  absorptive  parts 
of  the  amplitude  A^ ,  A^  and  A^,  for  processes  I,  II  and  III  respectively 

./    ^13(44)  1      Tj     /     ^12(44) 


I 


7Z  J  Si^         .§2  71  J  oj  —  cj 

71  J  S.^-  S2  71  J  «3  -  ^3 

^  71  J       '^  S'  —  S.  71  J  ^         S'  —  Sn 


(87) 


Equations  (85),  (86)  and  (87)  imply  a  number  of  important  properties: 

(1)  The  ordinary  dispersion  relations  hold  for  arbitrary  values  of  momentum 
transfer. 

(2)  The  absorptive  parts  satisfy  dispersion  relations  as  functions  of  momentum 
transfer  for  fixed  energy,  i.e.  they  are  functions  of  momentum  transfer,  regular 
except  for  cuts  along  parts  of  the  real  axis.  This  is  a  much  more  general  result 
than  the  ellipses  of  Lehmann. 

(3)  Analytical  properties  of  the  partial  waves  as  a  function  of  energy  follow. 
The  partial  waves  are  always  boundary  values  of  analytic  functions  which  are 
regular  except  for  cuts.  The  dispersion  relations  for  fixed  angular  momentum 
transfers  are  particularly  powerful  because  they  allow  a  simple  incorporation  of 
the  unitarity  of  the  iS-matrix  in  the  problem. 

(4)  One  of  the  most  important  applications  of  the  Mandelstam  representation 
consists  in  extrapolating  scattering  amplitudes  to  poles  in  the  unphysical  region 
with  the  aim  of  determining  the  parities  of  particles  and  the  coupling  constants. 
It  was  mentioned  in  the  last  section  that  the  residue  of  the  pole  in  the  amplitude 
for  pion-nucleon  scattering  for  fixed  momentum  transfer  gives,  apart  from 
constant  factors,  the  pion-nucleon  coupling  constant.  We  can  expect  poles  in  the 
scattering  amplitudes  for  other  processes.  Perturbation  theory  suggests  a  broad 
rule  for  locating  the  poles.  If  the  two  incoming  and  the  two  outgoing  particles 
in  any  of  the  three  channels  of  a  diagram  can  be  "connected"  by  a  stable  single 
particle  state  of  mass,  m^,  (by  which  is  meant  that  the  initial  and  final  two -particle 
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states  can  both  assume  the  same  quantum  numbers  as  the  single  particle)  then 
there  will  be  a  pole  when  the  s-variable  corresponding  to  the  square  of  the  four- 
momentum  in  this  channel  is  equal  to  m^ .  The  poles,  though  Ij^ng  on  the  real  axis 
can  never  lie  in  the  physical  region  since  then  the  single  particle  could  decay  via 
strong  interactions  into  either  of  the  two-particle  states  to  which  it  is  coupled. 
Applying  the  above  criterion  there  should  be  no  poles  in  the  pion-pion  scatterino 
amplitude  if  electro-magnetic  effects  are  neglected  since  a  two-pion  state  has 
quantum  numbers  different  from  any  known  particle  except  a  photon.  The  two- 
nucleon  channel  has  three  poles,  one  from  each  channel.  The  2iV^-channel  gives 
rise  to  a  pole  at  s^  =  Mj)  corresponding  to  the  deuteron  state  while  the  N  —  A 
channels  give  rise  to  poles  at  s^  ==  /n^  and  Sg  =  jbi^  both  corresponding  to  the  single 
pion  state.  Chew^  has  proposed  a  way  of  determining  the  pion-nucleon  coupling 
constant  from  the  residue  of  the  pole  arising  from  the  single  pion  intermediate 
state.  The  difficulty  in  such  a  determination  with  the  energy  as  the  variable  is 
the  existence  of  extensive  non-physical  contributions  to  the  dispersion  relations 
which  make  practical  applications  difficult.  But  Chew  has  conjectured  by  con- 
sidering the  Feynman  diagrams  for  the  process  that  if  the  scattering  amplitude 
is  considered  as  a  function  of  the  momentum  transfer  Zl^  at  fixed  energy  then 
there  is  probably  a  pole  of  residue  g^  (the  same  coupling  constant  as  in  the  case 
of  the  pion-nucleon  interaction)  located  at  Zl^  ^  — ^2  where  jn  is  the  pion  rest 

2ZI2 
mass.  Since  cos  6=1 r^  we  see  that  though  the  point  A^  =  —  /j,^  is  always 

unphysical  corresponding  to  cos  6=1+    iZ    >  1  this  point  comes  nearer  to  the 

K 

physical  region  as  k^  increases.  For  practical  reasons  the  region  of  very  high 
energy  is  not  suitable  and  so  we  must  consider  the  possibility  of  bridging  by 
analytic  continuation  the  appreciable  gap  between  the  physical  region  and 
the  position  of  the  pole.  Chew  has  proposed  such  an  extrapolation  procedure 
which  can  be  justified  using  the  Mandelstam  dispersion  relations.  From  an  in- 
spection of  (85)  it  follows  that  if  the  fixed  variable  s^  is  in  the  physical  energy 
range  then  if  the  remaining  two  variables  s^  and  s^  are  expressed  in  terms  of  cos  6, 
all  the  singularities  in  the  complex  plane  are  outside  the  physical  interval.  Further 
the  nearest  branch  points  lie  beyond  any  poles  that  occur.  Thus  a  simple  poly- 
nomial extrapolation  can  be  made  from  the  physical  region. 

Finally  we  may  mention  that  a  much  more  ambitious  proposal  for  applying 
the  Mandelstam  representation  has  been  made^  which  consists  in  combining  it 
with  the  imitary  relations  to  obtain  a  dynamical  theory  of  low  energy  pion- 
nucleon  phenomena. 


1  G.  F.  Chew,  Phys.  Rev.  112,  1380  (1958). 

2  G.  F.  Chew  and  S.  Mandelstam,  Phys.  Rev.  119,  467,  478  (1960). 


CHAPTER  VI 

STRANGE  PARTICLES  AND 
THEIR  INTERACTIONS 

1.  STRANGE  PARTICLES! 


Introduction 


We  have    till    now    considered   the   interactions   of  the  following  elementary 

irticles :  the  photon,  electron,  positron,  neutron,  proton,  jr-meson,  ^-meson  and 

le  neutrino.  In  investigations  on  cosmic  rays,  new  elementary  particles  were 

[iscovered  which  were  later  produced  artificially  in  high  energy  accelerators. 

Before  we  discuss  the  nature  of  the  interactions  involving  these  new  particles, 

will  be  interesting  to  attempt  a  definition  of  an  ''elementary  particle".  It  is 

ither  curious  that  no  such  definition  can  be  given  in  terms  of  their  intrinsic 

bttributes  though  they  are  certainly  characterized  by  them.  In  the  absence  of 

iteractions,  the  only  intrinsic  attributes  of  a  particle  are  its  mass  and  spin.  In 

^Interaction,  new  attributes  can  be  revealed  namely  charge,  parity,  strangeftess 

and  isotopic  spin.  The  lifetime  of  a  particle  can  also  be  treated  as  an  attribute 

as  the  concept  of  lifetime  involves  that  of  an  interaction  between  the  particle 

and  its  decay  product.  There  does  not  seem  to  be  any  specific  criterion  for  the 

values  which  these  attributes  can  take.  Masses  of  the  known  elementary  particles 

range  from  0  (photon)  to  2642  times  the  electronic  mass  (the  S  particle).  There 

is  no  specific  limit  for  the  spin,  though  on  present  evidence  it  is  firmly  believed 

that  the  elementary  particles  have  spin  0,  J  or  1.  The  lifetime  of  known  elementary 

particles  range  from  10  "^'^  sec  to  oo.  Thus  it  looks  as  if  there  can  be  no  well  defined 

criterion  for  "elementarity"  though  we  are  quite  clear  that  atoms  and  molecules 

cannot  be  called  elementary  particles.  The  definition  is  made  still  more  difficult 

as  there  are  some  theorists  who  consider  certain  elementary  particles  as  composite 

particles,  i.e.  compound  or  bound  states  of  other  elementary  particles.  We  shall 

be  content  with  defining  an  elementary  particle  as  one  whose  properties  cannot 

be  better  understood  by  breaking  it  into  sub -components. 

We  shall  now  discuss  how  the  various  intrinsic  attributes  are  assigned  to  the 
new  particles  through  a  knowledge  of  their  interactions.  The  new  particles  are 
the  following : 

A^  I\  ro,  Z-,     E^  E-,     K\  K\  K^,  K-. 

The  known  interactions  involving  their  production  and  decay  are  given  below. 


1  K  mesons  and  hyperons,  their  strong  and  weak  interactions  by  R.  H.  Dalitz,  Rep. 
Progr.  Phys.  20,  163  (1957).  Also  Hyperons  and  heavy  mesons,  systematics  and  decay  by 
M.  Gell-Ma:nn  and  A.  H.  Rosenfeld,  Ann.  Rev.  Nud.  Sci.  7,  407  (1957). 
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Some  typical  production  reactions  and  decay  schemes  of  strange  particles' 


Name  of 
the  particle 

Production  reaction 

Decay  scheme 

Lifetimes  in  sec  *" 

K^ 

7i-  +  p->2:-  +  K^ 

A'+->7r+  +  :nE0 

1.224  ±0.013x10-^ 

n+  +  p->Z'+  + A"-^ 

K- 

n-  +  p-*n  +  K^+  K- 

The  decay  modes  can  be  ob- 

p +  p-^K^  +  K- 

tained  from  those  of  K+  by 

changing     the    charge    and 

i-eplacing  v  by  v. 

A'o 

ii-  +  p^2:^  +  K^ 

K\->7l+  +  7l- 

1.00±0.04xl0-i» 

^A^  +  K^ 

-^  71+  +  71-  +  71^  (rare) 

-►7l=F  +^±  +V{V) 

-►ttT  +  e±  +  v{v) 

A'» 

7i-  +  p^n  +  K^  +  K^ 

K\-^  71+  +  71-  +  71^ 

6.llJ;?xlO-« 

p-^p-^K^  +  K^ 

-^7lT  +  ^±  +v{v) 
-^71+  +  e±  +v{^ 

.P 

A-  +  p->yl»  +  7r0 

A*^->p  +  7z- 

2.51  ±0.09  xlO-i* 

2:'-{-p->A^-^n 

->ji^  +  n 

p  +  p-^K^  +  A^  +  p 

-^p  +  fi-  +  vi^,) 

p  +  n-^  K°  +A^  +  p 

->p  +  e-  +v{t) 

->  A+  +  /I"  +  7i 

p  +  p^A^  +  A^ 

E' 

K-±p->Z<>-i-7T^ 
p  +  P^K+rZ^  +  P 

p  +  n-*  K^  +  E^  -rp 
-yK^  +  E^+n 

p  +  p-^E^-rE^ 

Z^  ->  .40  +  y 

<0.1x  10-10 

E- 

K-  +  p^E^  +  71- 

E+->p  +  .T» 

0.81  +  nixl0-i» 

p^-p-^K^-rE^-^p 

->  w  +  .t* 

-^K"-  +  E^  +  n 

p  +  p^E'^  +  E^ 

s- 

K-  +  p-^E-  +  71+ 
p  +  n->  K+  +  E'  +  p 
p^p^E-  +  'E- 

i7~  -►  w  +  :^~ 

i.6i:2:j;xio-i« 

-- 

7i-  +  p->E-  +  K+  +  K'' 

E-->A'  +  7i- 

i.3lS:|x  10-10 

—0 

Ti-^p^E^  +  K^+K^ 

I^-^A^  +  7l^ 

1.5x10-" 

*  (1)  Though  we  have  not  written  down  71-p  collision  reactions  for  hyperons,  they  must 
also  be  treated  as  production  reactions  for  hyperons.  In  a  similar  way  the  N-N  collisions  also 
produce  A^-mesons. 

(2)  We  have   only  written  reactions  involving  p.  We  can  also  write  71-N  collisions 
and  K-p  absorption  reactions. 

^  G.  A.  Snow  and  M.  M.  Shapiro.  Rev.  Mod.  Phys.  33,  231  (1961). 
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The  most  striking  feature  of  the  interactions  involving  the  new  particles  is 
their  copious  'production  which  is  irreconcilable  with  their  long  lifetimes.  In  order  to 
understand  this  contradiction,  Pais  proposed  the  idea  of  "associated  production" 
i.e.  the  hyperons  and  iC-mesons  are  always  produced  in  association  while  the 
particles  decay  separately.  Thus  the  interactions  involving  production  may  be 
considered  different  from  those  involving  decay,  the  former  being  assumed  to  be 
strong  while  the  latter  is  weak.  The  strength  of  the  interactions  involving  pro- 
duction is  of  the  same  order  of  magnitude  as  that  of  the  meson-nucleon  interaction 
while  the  decay  interactions  are  found  to  be  much  weaker  than  the  electro- 
magnetic interactions.  The  various  interactions  are  characterized  not  only  by 
their  strength  but  by  other  characteristics  as  well. 

Before  dealing  with  these  interactions  in  detail  and  comparing  them  with  the 
known  interactions  discussed  before,  it  will  be  convenient  to  classify  the  elementary 
particle  interactions  as  follows : 

(i)  Strong  interactions:  This  group  includes  the  forces  responsible  for  the 
production  and  scattering  of  nucleons,  pions,  hyperons  and  iC-mesons.  It  is 
characterized  by  a  coupling  constant  of  the  order  of  magnitude  unity  and  the 
conservation  of  Q,  I  and  I^ ,  the  charge,  the  total  isotopic  spin  and  its  z-component 
respectively. 

(ii)  Electromagnetic    interactions:    These    are    characterized    by   the    coupling 

constant  — —  =  1/137  and  the  conservation  of  Q  and  7^. 

Jh  C 

(iii)  Weak  interactions'^:  These  are  characterized  by  the  coupling  constant 
g'^jhc  ~  10"^*  and  the  conservation  of  Q. 

(iv)  Gravitational  interactions:   The  gravitational  interaction  is  characterized 
^      by  a  coupling  constant— 7 —  =  2xl0~^^    where    G    is    Newton's   gravitational 

constant  and  m  chosen  to  be  the  mass  of  the  proton.  The  gravitational  inter- 
action will  not  be  further  discussed. 


Isotopic  spin  and  strangeness 

We  now  make  a  physically  satisfying  assumption  that  no  elementary  particle 
can  have  multiple  charge,  i.e.  all  the  elementary  particles  are  either  neutral  or 
singly  charged.  The  relationship  between  isotopic  spin  and  charge  is  given  in  the 
case  of  nucleons  by 

Q-h  +  ^U  (1) 

and  in  the  case  of  mesons  by 

Q  =  Iz-  (2) 

^  The  lifetimes  of  weak  decays  however  have  a  very  wide  range  from  0. 75  x  10"^^  sec  to 
10^  sec.  The  universal  weak  interaction  can  explain  this  very  sensitive  dependence  of  the 
lifetime  for  decays  on  available  energy  (115  MeV  for  27+  and  0-75  MeV  for  neutron). 
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More  generally  we  can  write  the  relationship  between  Q^  and  /^  of  any  elementary 
particle  as  follows : 

Q  =  i.  +  vk-  (3) 

The  total  charge  Q  and  the  total  I^  of  a  system  of  elementary  particles  are  additive 

and  thus  the  relation  between  them  should  he  linear}  But  to  convince  ourselves  that 

in  the  relation  between  Q  and  /^  the  coefficient  of  I^  should  be  unity  we  adopt 

the  following  argument.  Consider  the  strong  interaction  involving  pions  and 

nucleons 

pions  +  nucleons  ->-  A  -\-  B. 

Two  pion-nucleon  systems  differing  in  7^  (and  not  in  y)  by  unity  will  also  differ 
in  charge  by  unity.  Since  charge  is  conserved,  the  corresponding  reaction  products 
in  the  two  cases  will  differ  by  unity  in  their  charge.  Let  the  relation  between 
Q  and  7^  for  systems  A  and  B  be  written  as : 

Since  Iz  is  also  conserved,  we  have  61^  =  dQ,  i.e. 

SI,  =  dif  +  dif  =  OCA  dif  +  (XB  SIf. 

This  should  be  true  for  all  values  of  dQ.  Hence 

O^A  =  (^B=   1  •  (6) 

Since  we  have  barred  the  existence  of  multiply  charged  particles,  the  7  value 
of  single  particles  is  limited  to  0,  ^/g  and  1.  Since  there  are  no  charged  particles 
of  about  the  same  mass  as  A^,  we  can  assume : 

I(AO)  =  0;       i,(^«)  =  0.  (7) 

Consider  the  reaction 

7z- +  p->A^  +  K^.  (8) 

The  left-hand  side   can  have  7  =  ^/g  or  ^/g.  The  right-hand  side  should  have  the 
same  value  of  7  and  7^  due  to  conservation  of  7  in  strong  interactions.  Therefore, 
from  (7)  we  conclude 
HK")^-'!,;       /,(^»)  =  -V2-  (9) 

1  By  Q  is  here  meant  the  ratio  of  the  amount  of  charge  to  charge  of  electron  and  this  is  an 
integral  number  for  all  particles.  The  question  arises  whether  we  could  assign  such  integral 
charge  quantum  numbers  to  the  various  particjes  but  conceive  of  a  diflPerence  in  the  amount 
of  charge  say  between  a  proton  and  an  electron  as  in  some  cosmological  theories  where  a  slight 
difference  is  postulated.  Would  this  imply  that  the  amount  of  charge  is  not  conserved  in 
processes  like  ^  decay  though  the  charge  as  an  integral  number  as  defined  by  (3)  is 
conserved ;  or  would  it  be  possible  to  observe  processes  like  e~  ->  y  +  y  and  e'  -^  2v  +  v 
whose  lifetime  is  of  the  order  of  10^^  years?  (Marshak  and  Sudershan,  Elementary  Particles, 
Interscience  1960). 

2  See  also  "Elementary  Particles  and  Weak  Interactions"  by  T.  D.  Lee  and  C.  N.  Yang, 
Lectures  at  Brookhaven  National  Laboratory,  p.  5  (1957).  We  have  to  assume  that  y  is  inde- 
pendent of  7j. 
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From  the  above  discussion,  the  corresponding  particle  with  I^  =  '^U  must  be  the 
positively  charged  particle  K^,  i.e.  K^,  K^  form  an  isotopic  doublet.  By  applying 
the  charge  conjugation  operator  to  K^,  K^,  we  can  generate  the  antiparticles 
K-  and  X«  with 

I{K')  ^  V2;         HK-)  =  V2;        I~Am  =  V2;  J^ziK-)  =  -1/2-  (10) 

We  have  thus  uniquely  determined  the  /  spin  of  the  K  particle.  The  pairs  K+, 
K^  and  K^,  K~  are,  like  nucleons,  spinors  in  isotopic  spin  space. 

Consider  the  reaction 

71- +  p^E- -\- K-^.  (11) 

Conservation  of  I^  on  both  sides  leads  to 

/,(i:-)  =  -i;/(i:-)=i.  (12) 

Thus  E'^,  H^,  E~  should  form  an  isotopic  triplet,  i.e.  E  particles  transform  like  a 
vector  in  charge  space  like  yr-mesons.  As  regards  the  cascade  particle  S,  from  the 
fact  that  S  is  produced  along  with  two  neutral  iT-mesons  in  n-  N  collisions, 
we  deduce  that  S'  corresponds  to  /^  =  — ^/g  of  an  isotopic  doublet.  Thus  7(5)  =  ^j^ ; 

I  AS')  =  V2.  (13) 

Having  assigned  /^  =  0  for  A^,  we  are  inevitably  led  to  the  consequence  that 

a  reaction  which  conserves  charge  need  not  conserve  I^ .  For,  consider  the  reaction 

7i+  7i->ylo  +  ylo.  (14) 

I       which  conserves  charge  but  not  I^  since  1^=  — 1  for  the  L.H.S.  and  0  for  the 

Y 
R.H.S.  But  we  know  that  Q  and  I^  are  related  by  a  linear  relation  ^  =  /^  +  — . 

Since  the  above  reaction  conserves  the  baryon  number  N ,  it  is  clear  that  Y  cannot 
be  just  identified  with  N' .  The  way  out  of  this  difficulty  was  suggested  by  Nishijima 
and  Gell-Mann^  by  defining  a  new  quantum  number  S  with 

Y==N^S  (15) 

each  particle  being  assigned  a  particular  S  value.  Thus  conservation  oiQ  implies 

o 

only  conservation  of  /^  +  ~^  together  and  not  separately.  The  calculation  of  S 
from  (15)  is  easy  and  we  collect  the  /,  I^,  Y,  S  values  in  a  table  of  reference. 


1  If  baryon  number  is  absolutely  conserved  there  should  be  no  physical  process  by  which 
the  proton  can  decay  into  lighter  particles.  The  search  for  this  has  set  a  lower  limit  of  3  x  10^* 
years  for  proton  decay  by  any  mode. 

2  M.  Gell-Mann,  Phys.  Rev.  92,  833  (1953);  NuovoCim.  Suppl.  2,  848  (1956).  T.  Nakano 
and  K.  Nishijima,  Progr.  Theoret.  Phys.  10,  581  (1953).  K.  Nishijima,  Progr.  Theoret.  Phys. 
13,  285  (1955). 
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There  exists  an  interesting  explanation  for  the  strangeness  quantum  number.^ 
We  have  ah-eady  estabUshed  that  the  conservation  of  charge  can  be  interpreted 
as  a  consequence  of  in  variance  under  a  gauge  transformation.  We  may  similarly 
introduce  a  baryon  gauge  transformation  applicable  to  nucleons  and  hyperons, 
i.e.  the  baryon  field  \p  is  replaced  by  e*'^-'^^  where  Xj^  is  a  c-number.  As  before  we 
can  easily  see  that  the  existence  of  such  a  transformation  will  lead  to  baryon 
charge  iV,  the  conservation  of  which  follows  from  the  invariance  of  the  Lagrangian 
under  such  a  transformation. 

Observing  that  nucleons  and  (K'^,  K^)  are  iso-spinors,  we  introduce  another 
gauge  transformation  for  these,  i.e.  \p-^e^^^\p.  For  (E" ,  S^)  which  is  also  an  iso- 
spinor  we  postulate  the  transformation  ip^^-e'^^^ip.  This  gauge  is  called  the  "hyper- 
charge  gauge".  The  conserved  quantity  due  to  this  transformation  is  called  the 
hypercharge  Y.  The  particles  "p,  n,  K^,  K^  are  called  isofermions,  S~  E^  and  the 
anti-particles  of  isofermions  are  called  anti-isofermions.  In  any  reaction  the 
number  of  isofermions  minus  the  number  of  anti-isofermions  will  be  a  constant  of 
motion.  Similarly  the  number  of  baryons  minus  the  number  of  anti-baryons  will 
remain  a  constant.  The  importance  of  these  two  gauges  is  revealed  through  the 
interesting  relation  between  strangeness  8,  the  hjrpercharge  Y  and  the  baryon 

number  N, 

S=  Y  —  N.  (16) 

It  should  be  noted  that  the  fermion  or  boson  character  of  a  particle  and  its  isofermion 
or  isohoson  character  are  entirely  independent.  The  names  "isofermion"  or  "isoboson" 
do  not  imply  anything  about  the  statistics  obeyed  by  the  particle. 

We  have  already  remarked  on  the  existence  of  four  iL-particlesiT +,  iC^  and  K^,  K~ 
as  a  pair  of  iso-doublets  (see  however  p.  221).  {K~ ,  K^)  is  the  anti-pair  oi{K'^,K^). 


1  B.  D'EsPAGNAT  and  J.  Prentki,  NilcI.  Phys.  1,  33  (1956).  See  also.  "Some  Theoretical 
Aspects  of  the  Strong  Interactions  of  the  New  Particles"  by  B.  D'Espagnat  and  J.  Prentki, 
Progr.  Elementary  Particle  and  Cosmic  Ray  Physics  4,  1  (1958). 
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It  is  interesting  to  note  that  though  K^  is  a  neutral  particle  it  has  a  distinct 
anti-particle  as  contrasted  with  the  case  of  7t^  which  is  its  own  anti-particle.  This 
is  due  the  fact  that  the  strangeness  of  K^  is  +1  while  that  of  n^  is  zero.^  This 
feature  has  some  interesting  consequences  which  will  be  discussed  below.  We 
can  show  that  K^  and  K^  cannot  be  identical  or  in  other  words,  K  particles  cannot 
form  an  isotopic  triplet  like  the  tt- meson.  Consider  the  virtual  reaction 

n<^A^  +  K^.  (17) 

^  +  XO<-^ylo.  (18) 


This  implies 


If  Zo  =  K"",  then  n 
Thus 


Xo^^/Lo  is  allowed. 


A^  +  .40, 


(19) 


a  reaction  which  has  not  been  observed  in  spite  of  its  low  threshold. 

We  will  now  list  some  of  the  results  which  follow  as  natural  consequences  of  the 
concept  of  strangeness.  It  is  clear  that  if  we  require  the  conservation  of  ^,  I  z 
and  N  for  strong  and  electromagnetic  interactions,  then  we  obtain  the  selection 
rule  A8=0  (i.e.  strangeness  is  conserved).  We  have  already  mentioned  that 
electromagnetic  interactions  violate  the  conservation  of  total  isotopic  spin. 

If  we  consider  collisions  of  pions  and  nucleons  the  initial  system  has  8=0 
and  the  conservation  oiS  clearly  requires  the  production  of  a  iT  particle  {8  =  +1) 
whenever  a  hyperon  (8  =  ^—\)  is  produced.  This  feature  in  the  production  of 
strange  particles  is  called  "associated  production"  and  was  originally  invoked  by 
Pais  in  order  to  reconcile  the  copious  production  with  the  long  lifetimes  of  the 
strange  particles.  We  can  equally  understand  the  higher  threshold  for  the  production 
of  K~  in  TT  —  N  OY  N  —  N  collisions  as  contrasted  with  K^ ,  for  K~  can  only  be 
produced  along  with  K^  as  there  are  no  hyperons  with  /S  =  +  1 .  Associated 
production  with  antihyperons  for  K~  is  not  possible  in  (n~  —  p)  collisions  as  it 
implies  the  violation  of  baryon  conservation.  If  we  now  turn  to  the  reaction  of 
K+  particles  with  baryons,  we  find  that  K^  particles  of  moderate  energy  inter- 
acting with  nucleons  can  only  be  elastically  scattered  due  to  the  absence  of 
hyperons  with  positive  8.  Since  8  =  —  1  for  the  hyperons,  the  K~  particle  can 
undergo  a  variety  of  interactions  with  nucleons  at  the  same  energy  besides 
elastic  scattering.  In  a  similar  way,  the  interaction  of  the  A^  particle  with  a  nucleon 
can  lead  only  to  elastic  scattering  while  the  Z  particle  will  interact  rapidly  with 
neutron  or  a  proton  at  all  energies  transforming  to  a  A^  particle  with  the  release 
of  approximately  80  MeV  of  kinetic  energy  {^^H  —  A^  mass  difference)  i.e. 

r-  +  v-^A""  +  n.  (20) 


+  P 


All  the  particles,  i.e.  hyperons,  K-  and  ^r-mesons,  remain  stable  when  only  strong 
and  electromagnetic  interactions  are  present  with  three  exceptions  namely  the  decays 
of  71^,  Z^,  E^.  Decays  lil^e  n^ p  -{-  n' ,  A^ -^ p  +  K~  or  Z^ ->  A^  +  71^  are  forbidden 


^  It  is  to  be  noted  that  even  though  neutron  has  a  distinct  anti-particle,  it  cannot  go  into 
an  anti-neutron  because  baryon  number  conservation  is  valid  even  in  weak  interactions. 

15* 


L 
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by  conservation  of  energy  though  they  conserve  strangeness  and  hence  they 
can  only  be  virtual  processes.  The  rule  AS  =  0  prevents  the  energetically  possible 
decays  like  A^  ^  p  +  7i~  or  K^  ->  n^  +  71^  through  strong  interaction. 

The  71^  meson  has  zero  charge  and  zero  strangeness  and  thus  the  transition 
of  the  Tif^  to  the  vacuum  can  occur  with  the  emission  of  y-rays,  i.e.  tt"  ->  y  +  y. 
It  is  to  be  noticed  that  this  reaction  requires  a  change  of  isotopic  spin  by  one 
unit  which  is  accomplished  by  the  electromagnetic  coupling.  Since  the  electro- 
magnetic interaction  is  proportional  to  the  charge,  it  is  linear  in  /^  and  hence  it 
transforms  in  the  isotopic  spin  space  like  a  scalar  plus  the  z  component  of  a  vector. 
If  we  treat  the  electromagnetic  coupling  as  a  perturbation,  whenever  it  occurs  it 
can  change  the  total  isotopic  spin  either  by  zero  or  by  one  unit,  i.e..  |zj  /|  ==  0  or  1. 
Turning  to  E^  we  find  that  the  transition  Z^-^A^  +  y  satisfies  AQ=-0,  AS=  0, 
\Al\  =  1 .  Hence  the  transition  will  take  place  through  electromagnetic  coupling. 
A  rough  theoretical  estimate  of  lifetime  can  be  obtained  if  we  assume  a  magnetic 
dipole  transition  between  particles  of  spin  1/2  {Z^  amdA^)  with  a  transition  magnetic 
moment  approximately  equal  to  the  neutron  magnetic  moment.  We  then  have 
lifetime  of  2"°  :^  5  x  10 "^^^  sec.  Thus  Z^  undergoes  a  decay  too  rapid  to  be  observed 
directly.  As  a  result  of  this  rapid  radiative  decay,  Z^  will  rarely  undergo  the  slow 
pion  decay  Z^->p  +  7i~  +  110  MeV. 

All  the  baryons,  anti-baryons  and  mesons  which  are  stable  under  strong  and 

electromagnetic  interactions  decay  with  finite  lifetimes  into  other  particles.  The 

interactions  which  cause  these  decays  are  called  "weak  interactions"  because  the 

coupling  strength  for  these  interactions  is  of  the  order  10"^*.  The  strange  particles 

which  are  produced  in  strong  interactions  should  decay  very  rapidly,  i.e.  they 

h 
should  have  a  lifetime  '^ ^'^  lO'^^sec.  However  lifetimes  of  these  particles 

range  mostly  from  10 ~^  to  10"^^  sec.  It  is  this  unexpectedly  long  life  time  of  the 
strange  particles  that  caused  theoretical  physicists  to  look  for  a  selection  rule  which 
prevents  rapid  decay  of  strange  particles.  It  was  in  a  search  for  an  explanation 
of  these  seemingly  irreconcilable  facts  that  Nishijima  and  Gell-Mann^  postulated 
the  strangeness  quantum  number.  On  the  basis  of  this  scheme  of  assignment  of 
strangeness  S,  the  weak  interactions  violate  conservation  of  S  as  is  seen  from  the 
table  (p.  226)  i.e.  AS  =  ±1.  The  decay  S~  -^n  +  7z-  which  is  energetically  more 
favourable  than  the  observed  decay  A^  +  7t~  is  perhaps  forbidden  because  the 
decay  requires  a  change  of  strangeness  by  two  units.  Recent  developments 
indicate  that  all  the  weak  interactions  violate  also  the  symmetry  under  parity 
and  charge  conjugation  operation  (to  be  discussed  later). ^  Thus  the  study  of 
strange  particle  interactions  divides  itself  into  two  classes, 

(1)  Strong  interactions  involving  productions, 

(2)  Weak  interactions  involving  decays. 


1  M.  Gell-Mann,  Phys.  Rev.  92,  833  (1953) ;  Nuovo  Cim.  Suppl.  2,  848  (1956).  K.  Nishijima, 
Progr.  Theoret.  Phys.  13,  285  (1955). 

2  See  chapter  V,  section  2. 
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Surprising  as  it  may  seem,  it  is  the  attempt  at  a  theorj^  of  weak  interactions 
that  has  led  to  some  definite  and  interesting  conclusions,  while  the  theory  of 
strong  interaction  is  yet  in  a  primitive  and  speculative  stage.  We  shall  first  deal 
with  the  decays  of  K  particles,  the  study  of  which  led  to  the  postulate  of  non- 
conservation  of  parity  with  its  far  reaching  consequences.  We  shall  then  examine 
in  some  detail  the  theory  of  weak  interactions  particularly  on  the  basis  of  a 
universal  coupling  between  four  fermions. 

By  considering  the  strangeness  quantum  number  we  have  been  able  to  classify 
interactions  but  it  does  not  lead  to  any  detailed  predictions  of  cross -sections  or 
quantitative  relationships  between  various  processes.  However  the  assumption 
of  charge  independence  for  interactions  involving  strange  particles  will  give  rise 
to  many  quantitative  relationships  between  various  interactions  connecting 
given  multiplets.  Various  equalities  and  inequalities  resulting  from  charge  in- 
dependence in  elementary  processes  can  be  derived.^  We  will  now  discuss  a  few 
examples  of  these  relations  relating  to  the  capture  of  K  particles  by  nucleons. 

Consider  the  production  of  27 particles  on  capture  of  iT  by  nucleons.^  The  initial 
K~  +  p  system  has  two  particles  each  with  /  =  1/2  and  since  the  total  I^  =  0, 
the  system  will  be  a  mixture  of  /  =  0  and  /  =  1  states.  Let  a^  and  a^  denote  the 
reaction  amplitudes  for  states  1  =  0  and  I  =  1 .  In  the  final  state  we  require 
/  =  0  or  1  and  we  should  decompose  a,  E+n  system  into  7=1  and  7=0  states 
for  charge  independence  requires  that  7  should  be  a  constant  of  motion  dm*ing 
the  reaction,  i.e.  the  reaction  amplitudes  depend  only  on  total  7.  Hence  the  final 
E  +  71  state  will  be 


1 


(21) 


|/3        j/2 

where  we  have  made  use  of  the  necessary  Clebsch-Gordon  coefficients  to  get  the 
various  substates  arising  from  the  composition  of  two  7=1  particles  (cf.  Ap- 
pendix 2).  Thus  {K~  +  p)  reaction  will  lead  to  the  following  ratio  for  the  pro- 
duction of  charged  and  neutral  i7's 

Remembering  that  CTq  and  a^  are  generally  complex,  charge  independence  provides 
us  with  the  inequality 

]ln  [E^)  +  ]ln  {E-)  ^  2  }/?i  (Z^)  ^  |  }ln  {E^)  -  ^/n  {E-)  \ ,  (23) 

where  n{E)  denote  the  number  of  E  particles  of  specified  charge. 

1  D.  Feldman,  Phys.  Rev.  103,  254  (1956). 

2  K.  M.  Case,  R.  Karplus  and  C.  N.  Yang,  Phys.  Rev.  101,  358  (1956). 
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The  capture  of  K~  by  neutrons  can  go  only  through  7=1  state  since  /^  =  — 1 
and  hence  the  final  H  -{-  ti  state  is 

a,0,__,{l7t)  =  -^[U.{i:)U„{7i)-U,(i:)U_{7t)].  (24) 

Thus  we  have 

(i:-:^):(i:»;i-)  =  V2«!:V2«f-  (25) 

If  K~  is  incident  on  He*  or  C^^^  [^  interacts  with  the  neutron  or  the  proton  with 
equal  probability.  The  number  of  27  particles  of  given  charge  can  be  obtained  by 
averaging  the  probabilities  given  in  (22)  and  (25)  and  we  thus  have 

n{Z^)  +  n(I-)^2niy,  (26) 

a  relation  independent  of  the  values  of  a^  or  a^ .  The  difficulty  of  identifying  -T" 
renders  verification  of  even  this  simple  relation  by  experiment  difficult.  One  of 
the  simplest  predictions  is  due  to  Lee  ^  who  has  discussed  the  absorption  of  K~ 
particles  by  He*  nuclei.  We  note  that  since  He*  nucleus  has  7=0,  the  initial 
system  has  an  isotopic  spin  /  =  ^/g  and  thus  there  is  no  mixture  of  states  in  the 
initial  system.  We  can  deduce  that  the  branching  ratios  between  reactions  (a)  and  (b) 
should  be  as  2  :  1 

f  I-  +  He3  {  A^  +  7i-  +  He3  (a) 

K--\-lle*^\   ^  or     !  ^  ^  (27) 

^  [  2:0  +  H3  1  yio  +  7r«  +  He3  (b) .  ^     ' 

Experimental  confirmation  of  such  relations  will  strengthen  the  postulate  of 
charge  independence  in  interactions  involving  the  strange  particles.^ 


2.  DECAYS  OF  K-PARTICLES^ 

Charged  as  well  as  neutral  iT-mesons  decay  by  a  variety  of  modes  and  the 
multiplicity  of  decay  modes  initially  created  quite  a  confusion  in  the  identification 
of  the  particles.  At  the  beginning,  each  physically  different  mode  of  decay  was  pheno- 
menologically  characterized  as  a  different  particle.  It  is  now  weU  established  that  all 


1  T.  D.  Lee,  Phys.  Rev.  99,  337  (1955). 

2  Saktjrai  first  noted  [see  Phys.  Rev.  107,  908  (1957))  that  if  charge  independence  is  valid 
for  the  reactions  71+  +  ;? ^  i7+  +  X+ ;  7i~  +  p->  2^  +  K^  and  n'  +  p^  U'  +  K"^  the  relation 
among  the   differential   cross -sections   for  i7+,    U^   production    is    [2dor(2'")]J  ^  [d(T(Z+)]J 

+  [da(i7-)]2 .  Earlier  experiment  by  Brown  et  al.  {Phys.  Rev.  107,  906  (1957))  indicated  that 
this  relation  seemed  to  break  down  for  the  backward  produced  Z  's.  However  a  recent  experi- 
ment by  Crawford  et  al.  {Phys.  Rev.  Lett.  3,  394  (1959))  has  verified  that  this  relation  holds 
good.  Their  results  also  show  that  this  relation  is  more  an  equality  and  that  most  of  the  2"° 
and  IJ~  are  produced  in  /  =  ^/g  state. 

3  For  complete  information  on  experimental  aspects,  see  the  Study  of  Elementary  Particles 
by  the  Photographic  Method  by  C.  F.  Powell,  P.  H.  Fowler  and  D.  H.  Perkins,  Pergamon 
Press  (1959). 
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b 


the  different  mesons  have  the  same  mass  and  lifetime.  We  list  in  the  table  below 
the  available  information  on  the  decay  modes  of  charged  K^.  Corresponding 
decay  modes  exist  for  the  K~ ,  suitable  changes  being  made  in  the  charges  of  the 


Name 

Decay  scheme 

Branching  ratio 
(per  cent) 

0+ 

7rO  +  7r+  +  220MeV 

25 

T 

71+ ±71- +  71^  +    75MeV 

6 

T 

7r+  +  ^°  +  7rO  +  84MeV 

2 

K,,  2 

Z^+i'  +  SeSMeV 

58 

Kfxz 

^+  +  7rO  +  v  +  253MeV 

5 

a;  3 

e++7r°  +  v4-358MeV 

4 

decay  products.  The  most  frequent  mode  in  the  decay  of  charged  K  is  K^2y  i-^- 
K'^{K~)->jbi^{/j,~)  +  v{v).  The  analogous  decay  mode  of  K  into  an  electron  and  a 
neutrino,  i.e.  K-^(K'')  ->  e+ [e-)  +  v{v)  (i.e.  K^^)  seems  to  be  rare  or  absent.  Ex- 
periments on  K^  stopping  in  emulsion  with  emission  of  c"*"  indicate  that  the  ratio 

— ; is  probably  >0.1  per  cent.  The  existence  or  absence  of  Aigg  decay  mode  has 

important  implications  as  we  shall  see  later  when  we  discuss  the  Universal  Fermi 
interaction.  It  is  to  be  noted  that  there  is  no  K  decay  which  involves  neutrino 
pair  emission  as  in  the  case  oi  [j,  decay. 


The  T+-0+  puzzle  1 

The  idea  that  the  various  decay  modes  of  K+  particles  empirically  denoted  by 
different  names  like  t"*",  0^  etc.,  are  merely  manifestations  of  a  single  particle, 
brought  in  its  wake  a  puzzle  which  troubled  theoretical  physicists  for  more  than 
a  year.  If  0+  and  t"*"  are  the  same,  their  intrinsic  parity  should  also  be  identical 
and  if  the  conservation  of  parity  is  considered  valid,  then  since  the  27C  and  3jr 
decay  modes  indicate  that  the  parities  of  d+  and  t+  are  different,  they  cannot 
be  the  same  particle.  However  if  0+  and  t"*"  are  different  particles,  there  is 
no  explicable  reason  why  their  masses  and  lifetimes  should  be  identical. 
If  the  postulate  of  conservation  of  parity  is  given  up  the  puzzle  of  course  does 
not  exist.  In  view  of  the  fundamental  importance  of  the  study  of  the  two  decay 
modes  which  led  Lee  and  Yang  to  question  the  conservation  of  parity  in  weak 
interactions,  we  give  a  detailed  discussion  of  the  analysis  by  Dalitz.  This  is  based 
on  the  assumption  that  parity  is  conserved  in  decay  interactions. 

If  the  0+  particle  has  spin  j,  since  we  are  considering  its  decay  at  rest,  it  is 
clear  that  the  relative  motion  of  the  two  pions  must  posses  an  orbital  angular 
momentum  /  since  the  pions  are  spinless.  Assuming  parity  to  be  conserved  in 


1  See  R.  H.  Dalitz,  loc.  cit.,  pp.  195. 


232  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

the  decay,  this  decay  will  be  allowed  only  if  the  6^  particle  has  an  intrinsic 
parity  co  given  by 

w={-iy.  (1) 

Thus  the  0^  particle  can  have  the  spin-parity  assignments  (0+)(l~)(2+)(3~).^ 

All  the  three  pions  in  the  t^  decay  mode  may  be  followed  to  the  end  of  their  range 
and  this  allows  us  to  establish  fully  the  configuration  of  the  decay.  The  event  has 
the  well-known  Q  value  ^'TS.l  MeV.^  This  information  along  with  the  assumption 
of  the  conservation  of  momentum  helps  us  to  fix  the  configuration  of  the  decay 
exactly  even  if  only  two  of  the  three  pions  are  observed.  The  configuration  is 
conveniently  specified  by  two  momenta;  p  the  momentum  of  the  7r~  in  the  initial 
rest  system  of  t+  and  q  the  momentum  of  one  of  the  two  7r+  in  their  centre  of 
mass  system.  We  have  the  relation  (see  Appendix  5). 

g=i(if|-3m2-2^,J57//.  (2) 

where  Ep  =  (m^  +  p^Y'*  is  the  total  tt"  energy.  If  d  denotes  the  angle  between  p 
and  q,  cos  6  is  related  to  the  observed  momenta  by  the  expression 

pq       [        XM.-Epl]  p  q 

Pi,  coj  and  p^,  CO2  being  the  observed  momenta  and  energies  of  the  two  jf^. 

In  order  to  discuss  the  spin-parity  assignments  of  t"*",  let  us  denote  by  L  and  /, 
the  orbital  angular  momenta  associated  with  the  linear  momenta  p,  q  respectively. 
The  two  positive  pions  are  identical  Bose  particles  and  hence  they  must  be 
described  by  a  wave  function  which  is  symmetric  under  the  interchange  of  the 
two  particles.  This  is  possible  only  if  their  relative  motion  has  even  parity  and 
so  I  must  be  even.  Taking  into  account  the  odd  intrinsic  parity  of  the  three  pions, 
conservation  of  parity  in  t+  decay  requires  that  the  intrinsic  parity  co  of  t"*"  be 

co  =  -(-l)^.  (4) 

We  note  therefore  that  the  intrinsic  parity  is  determined  by  the  orbital  angular 
momentum  of  the  odd  negative  pion.  Further  if  t+  has  a  spin  j,  then  from  the  usual 
vectorial  addition  of  angular  momenta,  we  must  have 

J=L+l.  (5) 

If  j  =  0,  then  L  =  I  i.e.  L  is  even  and  the  parity  of  t+  is  necessarily  negative. 
Thus  for  zero  spin  r  ^  6.  For  higher  values  of  j,  we  can  make  the  following 
statements. 

(a)  If  there  exists  an  S  wave  :7i;~  meson,  its  parity  is  odd  and  its  spin  is  even. 
Since  L  =  0,  co  =  1  and  j  =  I  and  is  even.  Thus  the  probability  density  for  t"*" 


1  Associating  0+  with  6^  which  decays  into  £71°,  we  can  conclude  that  0+  will  have  only 
even  spin  since  27z^  can  have  only  even  orbital  angular  momentum. 

^  By  Q  value  we  refer,  as  usual,  to  the  kinetic  energy  available  for  the  particles  in  the 
final  state. 
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decay  events  must  vanish  for  the  decay  configuration  in  which  the  :7r"  meson 
is  at  rest,  unless  the  t"*"  parity  is  odd  and  its  spin  is  even.  It  should  be  noted  that 
for  even  spin  6  has  even  parity. 

(b)  If  the  two  7r+  mesons  have  a  relative  S-wave  motion,  i.e.  Z  =  0 
then,  CO  =  —  (— 1)^  =  —  (— 1)^  This  implies  that  if  t^  =  6+  the  probability  density 
for  T+  decay  events  must  vanish  for  the  decay  configuration  in  which  tt"  meson 
has  its  maximum  energy. 

(c)  A  configuration  in  which  three  pions  have  parallel  momenta,  i.e.  cos0  =  1 
cannot  result  from  t+  decay  if  its  spin-parity  relation  is  the  same  as  that  of  0+. 
The  total  parity  of  this  "linear"  configuration  is  necessarily  ^(—^1)*  and  hence 
the  probability  density  for  r"^  decay  events  must  vanish  for  a  "linear"  configuration 

ifT+=0+. 

These  three  situations  provide  an  essential  basis  to  distinguish  between  various 
spin-parity  possibilities  for  the  r  particle  and  also  between  t  and  6.  The  proba- 
bility for  T  decay  configurations  consists  essentially  of  two  factors, 

(a)  The  phase  space  density  q  given  by 

pq 


QdEj,d{GOsd)  = 


Mi  +  ml-2M,Ep 


dEj,d{GosO). 


(6) 


(b)  The  probability  density  99  which  is  the  square  of  the  amplitude  8{m)  for 
this  decay  configuration  averaged  over  r-spin  orientations,  i.e. 

1 


<P  = 


ITTT^^I^WP' 


(7) 


Plotting  the  observed  decay  events  with  t  =  E^  —  m^  and  cos0,  it  was  found 
that  the  events  are  uniformly  distributed  indicating  no  strong  dependence  either 
on  t  or  COS0  as  should  be  the  case  if  t"*"  =  0+.  Figs.  (32)  and  (33)  show  the  distri- 
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Fig.  32.  Distribution  of  t"*"  decay  events  in  cos  d. 


Fig.  33.  Distribution  of  t"*"  decay  events  as  a  function  of  n~  kinetic  energy  obtained 

from  892  observed  events;  the  hatched  histogram  refers  to  events  of  Baldo-Ceolin  et  al. 

only.  The  dotted  curve  shows  the  spin  i7+  distribution  of  Marshak  and  Sudarshan. 

The  dashed  curve  is  the  prediction  on  Schwinger's  model. 


butions  of  events  plotted  against  7t~  energies  and  cos0  separately.  From  angular 
momentum  considerations  alone  the  decay  of  r'^  meson  from  a  state  with  spin 
component  m  can  be  described  by 

C{Llj-  m^,  m^)  7f X (cos 0^)7r' (cos 61,), 


S{m)=-.ZdLi{V^q^)       Z 
L I  m£  4  mi  = 


(8) 


where  (7's  are  the  Wigner  coefficients  (see  Appendix  2).  The  di  are  scalar  coeffi- 
cients and  are  functions  of  p^  only. 

We  will  consider  whether  there  exists  any  spin-parity  possibility  for  which 
both  271  and  Sji  decay  are  permissible.  The  distributions  for  different  configurations 
for  the  various  spin-parity  assignments  have  been  worked  out  by  Dalitz^,  Fabri^ 
and  others.  From  Figs.  (32)  and  (33),  we  immediately  neglect  the  case  of  1" 
as  it  disagrees  with  the  data  in  all  respects.  Spin-parity  2+  represents  the  lowest 
possible  value  for  which  r  and  6  can  be  identical.  The  configurations  (0,  0),  (1,2) 
do  not  at  all  agree  with  experimental  data.  Marshak  and  Sudarshan^  have  in- 
vestigated the  effect  of  including  the  configuration  (3,  2).  Though  the  inclusion 
improves  the  agreement,  n~  energy  distribution  proves  to  be  a  difficulty.  To  get 


1  R.  H.Dalitz,  Phil.  Mag.  44,  1068  (1953);  Phys.  Rev.  94,  1046  (1954). 

2  E.  Fabri,  Nuovo  Cim.  11,  479  (1954). 

^  R.  E.  Marshak,  Proc.  Roch.  Conf.  on  High  Energy  Physics,  1956. 
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the  high  probability  observed  for  very  fast  yr-mesons,  it  will  be  necessary  to 
postulate  a  resonance  near  zero  energy  in  the  i)-wave  for  two  7r+ -mesons  and 
this  requires  a  n'^ — jr+ interaction  which  has  no  experimental  justification^. 
Still  there  are  a  large  number  of  slow  jr-mesons.  Higher  spins  will  slow  down  t+ 
decay  because  of  their  large  centrifugal  barrier  and  further  in  these  cases  we  have 
to  explain  the  absence  of  the  radiative  decay  of  r, 


/^^  +  y 


^71^  +  e^  +  e 


(9) 


which  will  be  just  forbidden  if  t's  are  spinless.  Teucher,  Thirring  and  Winzeler^ 
have  suggested  a  search  for  angular  correlations  in  the  spatial  orientations  of  t"*" 
decay  with  respect  to  the  production  plane  which  should  exist  "barring  accidents". 
For,  any  polarization  of  the  t+  particle  when  it  is  produced  will  be  reflected  in  the 
distribution  of  the  .q;--decay.  Orear  et  al.^  have  reported  no  significant  deviations 
from  isotropy  but  this  may  be  due  to  the  fact  that  the  production  process  do 
not  polarize  the  K  particle  as  they  are  spinless. 

To  sum  up,  the  statistical  data  on  r  decay  configurations  is  consistent  with 
spin-parity  assignment  0~  for  r,  0+  for  6.  Therefore  t  and  6  are  distinct  particles 
as  long  as  parity  conservation  is  assumed.  Dalitz*  has  also  pointed  out  the  sym- 


M^        ~~~  A^         p  N 

Fig.  34.  Triangular  plot  for  representation  of  t+  decay  events. 

metry  between  the  three  pions  irrespective  of  their  charge  in  the  t+  decay.  In 
Fig.  34,  XP,  XQ,  and  XE  are  the  perpendiculars  from  X  to  the  sides  of  an  equi- 

^  The  departure  of  the  T+-decay  spectrum  from  the  one  due  to  phase  space  amounting  to 
a  25%  variation  of  the  matrix  element  over  the  7c+  or  7i~  energy  has  been  attributed 
to  the  final  state  tz-ti  interaction.  See  the  Report  of  the  Tenth  Annual  Conference  on 
High  Energy  Physics,  Rochester,  pp.  518  et.  seq. 

2  M.  Teucher,  W.  Thirring  and  H.  Winzeler,  Nuovo  Cim.  1,  733  (1955). 

3  J.  Orear,  G.  Harris  and  O.  S.  Taylor,  Phys.  Rev.  102,  1676  (1956). 
'  R.  H.  Dalitz,  Proc.  Phys.  Soc.  A69,  527  (1956). 
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lateral  triangle  LMN  and  are  proportional  to  n~  and  jt"^  kinetic  energies  respec- 
tively. Momentum  conservation  confines  the  point  X  to  the  inscribed  semi-circle 
within  the  triangle.  Fabri^  has  shown  that  these  boundaries  are  considerably 
distorted  if  relativistic  effects  are  taken  into  account.  The  lines  OB  and  OC 
divide  this  allowed  region  into  three  equal  parts,  Y  and  Z  obtained  by  reflecting  X 
in  OC  and  then  Yin  OB,  which  represent  configurations  with  the  same  pion  energies 
but  a  different  distribution  of  charges.  There  is  good  experimental  evidence  for 
the  strong  symmetry  among  the  three  pions. 

Pais^  pointed  out  the  possibility  for  an  alternative  Stt  decay  called  the  t' 
mode,  i.e.  t^->:7i^+  tt®  +  n^.  The  maximum  energy  allowed  for  a  changed  tz 
meson  is  55  MeV  and  the  Q  value  is  86  MeV.  The  statistically  most  significant 

t' 
value  for  the  ratio  of  —  is  given  by 

—  =  0.332  ±0.067.  (10) 

The  6  —  T  puzzle  does  not  exist  if  the  postulate  of  parity  conservation  is  given 
up.  Before  this  far  reaching  development  took  place,  there  had  been  quite  a 
few  suggestions  to  solve  the  6  —  r  puzzle  still  assuming  that  parity  is  conserved 
in  weak  interactions. 

Lee  and  Orear^  attempted  to  explain  the  apparent  equality  of  lifetime  of  t 
and  6  as  due  to  a  cascade  process.  They  assumed  that  t  and  6  are  different  particles 
with  two  different  lifetimes  10~^  and  10"^^  sec.  Besides  the  various  decay  modes, 
they  suggested  that  t  decays  into  6  by  emitting  y-radiation,  i.e. 

T+->0+  +  y.  (11) 

If  we  assume  that  t  and  6  are  spinless,  i.e.  (0)  and  (0"*"),  then  we  require  emissions 
of  two  photons,  i.e. 

T+->0+  +  7  +  y.  (12) 

The  secondary  0+  will  give  rise  to  K„2  events.  As  the  lifetime  of  0+  is  smaller 
compared  to  that  of  t,  any  directly  produced  0+  will  decay  very  close  to  the  target 
and  would  not  contribute  to  the  K„2  events.  Thus  the  lifetime  of  the  iL^2  events 
would  necessarily  equal  the  lifetime  for  Stz  decay  since  the  observed  K„2  decays 
are  mostly  due  to  the  secondary  0"*".  It  is  also  clear  that  mass  measurements 
by  momentum -range  method  will  be  appropriate  only  in  the  case  of  the  t"*^  particle. 
In  order  to  give  a  radiative  lifetime  of  10~®  sec.  for  t"*"  and  to  have  the  correct 
branching  ratio  to  explain  K^^^  decay,  it  is  found  that  a  mass  difference 
{m^+  —  mQ+)  ^  lOnig  is  needed.  Further  the  search  for  the  y-ray  by  Alvarez* 
proved  futile.  This  proposal  is  now  given  up. 

1  E.  Fabri,  loc.  cit. 

2  A.  Pais,  Phys.  Rev.  86,  663  (1952). 

3  T.  D.  Lee  and  J.  Obear,  Phys.  Rev.  100,  932  (1955). 

*  L.  Alvarez,  Proc.  6th  Rochester  Conf.,  Interscience,  New  York  (1956). 
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Lee  and  Yang^  put  forward  a  proposal  to  explain  the  mass  degeneracy  of  t 
and  6  based  on  the  "parity-doublet"  hypothesis.  They  argued  that  just  as  there 
is  a  mass  degeneracy  between  proton  and  neutron  (i.e.  states  of  different  charges) 
there  can  exist  a  mass  degeneracy  between  states  of  different  parities.  If  Cp  is  the 
operator  that  switches  t  into  6  and  vice  versa  (i.e.  switches  the  parity)  Cp  is  called 
paritj^  conjugation  in  analogy  with  charge  conjugation. 

Cp\ry^  \ey;    Cp\ey=  |t>.  (13) 

The  approximate  mass  degeneracy  between  t  and  6  follows  if  we  postulate  that 
the  strong  part  of  the  Hamiltonian  commutes  with  Cp , 

i.e.     [Cp,H]  =  0.  (14) 

This  implies  that  under  strong  interactions  alone  m^=  niQ.  If  we  consider  a 
reaction  like 

71- -\- p-^A"" +  0'*  (15) 

and  operate  Cp  on  it,  then  it  follows  that  A^  must  be  a  parity  doublet  which  we 
shall  call  by  A^  and  Al .  Under  Cp 

TT- +  :p-^ylj  + 00  (16) 

becomes  .n        ^ 

71- +  p-^Al  +  T^.  (17) 

Thus  it  is  necessary  that  all  the  hyperons  should  be  parity  doublets.  We  have 
assumed  that  pions  and  nucleons  are  unaffected  by  O^ .  A  definite  prediction  of 
this  theory  is  that  for  7t-N  or  N~N  collisions,  the  production  cross-section  and 
angular  distribution  for  a  given  strange  particle  must  be  equal  to  those  for  the 
corresponding  parity  conjugate  particle.  Thus  it  follows  that  r/d  ratio  at  produc- 
tion is  necessarily  unity.  Treiman^  has  studied  the  effect  of  parity  doublet  theory 
in  hyperfragments  and  Lee,  Yang^  and  Gatto*  have  pointed  out  some  interesting 
interference  effects  whose  observation  would  indicate  the  existence  of  a  parity 
doublet.  As  such  there  is  no  direct  empirical  evidence  which  requires  the  existence 
of  hyper  on  parity  doublets. 

Xeutral  K  decay 

We  have  already  noted  that  the  existence  of  a  neutral  K^  particle  with  a  quite 
distinct  antiparticle  K^  has  some  interesting  consequences  which  were  first  studied 
by  Gell-Mann  and  Pais.^  In  strong  and  electromagnetic  interactions  owing  to 
conservation  of  strangeness,  K^  and  K^  cannot  be  connected  even  by  virtual 
processes.   However  the  situation  is  different  in  decay  interactions  in  which 


1  T.  D.  Lee  and  C.  N.  Yang,  Phys.  Rev.  102,  290  (1956). 

2  S.  B.  Treiman,  Phys.  Rev.  104,  1475  (1956). 

^  T.  D.  Lee  and  C.  N.  Yang,  Phys.  Rev.  104,  822  (1956). 
4  R.  Gatto,  Nuovo  Cim.  4,  197  (1956). 

^  M.  Gell-Mann  and  A.  Pais,  Phys.  Rev.  97,  1387  (1955).  In  this  paper  they  studied  this 
effect  assuming  charge  conjugation  invariance. 
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strangeness  is  not  a  good  quantum  number  and  is  not  conserved^.  We  instead  con- 
sider the  quantum  number  CP  as  we  expect  weak  int-eractions  to  remain  invariant 
under  CP.  We  can  form  suitable  linear  combinations  of  K^  and  R^  which  are 
eigen  states  of  CP.  Noting  that  since  K^  and  R^  are  antiparticles,  these  two  go 
into  each  other  under  CP,  if  we  define 

'Kt>  =  ^\Koy  +  ^\R'>y  (18) 

we  find  that  |iCj>  and  \K\y  are  eigen  states  of  CP  with  eigenvalues  +  1  and  —  1 
respectively. 

Since  CP  is  conserved  in  weak  interactions,  in  decay  processes  we  can  treat 
A' J  and  K^  as  basic  particles  with  their  own  modes  of  decay  and  consider  the  decaj^s 
of  K^  and  R^  as  a  consequence  of  their  being  linear  combinations  of  JlJ  and  Ko. 
We  note  the  important  feature,  that  the  lifetimes  of  XJ  and  K^  will  in  general 
be  diiferent  since  conservation  of  CP  during  decay  makes  available  certain  modes 
to  K^  which  are  forbidden  to  K^.  K^  cannot  decay  into  a  27i  system  with  CP  =  +  1 . 
The  state  [y){jLi~  +  r  +  Tf^)  — v(/^^  +  v  +  7i~)]  corresponding  to  CP  =  — 1  is 
a  possible  final  state  for  the  decay  of  K^  and  comparison  with  XJ3  suggests  that  K^ 
can  have  a  lifetime  exceeding  that  of  K^  by  a  factor  of  10^.  Thus  we  are  led  to 
predict  the  existence  of  two  neutral  particles,  one  short-lived  and  another  long- 
lived. 

Suppose  we  generate  a  beam  of  iT-mesons  by  the  reactions  71'  +  p  ->  A^  -{-  K^. 
We  note  from  (18)  and  (19)  that  half  of  them  are  iCj  and  the  other  half  K^ .  After 
a  lapse  of  time  '—'10"^  sec.  most  of  the  K^  would  have  decayed  into  27i  while 
very  few  of  the  K^  would  have  decayed.  The  "stale"  beam  consists  mostly  of  K^. 
Since  K^  is  a  "mixture"  state  of  strangeness,  the  "stale"  beam  when  allowed  to 

^  It  is  an  interesting  question  under  what  circumstances  we  can  consider  linear  combi- 
nation of  two  different  particle  fields  as  representing  the  field  of  a  new  particle;  more  so 
when  we  wish  to  have  a  linear  combination  of  particles  and  antiparticles.  In  discussing 
this  it  is  convenient  to  divide  'antiparticles'  into  three  classes: 

1.  Particles  and  antiparticles  are  not  identical  and  the  number  of  particles  minus  anti- 
particles is  conserved,  as  in  the  case  of  fermions  which  can  be  baryons  or  leptons. 

2.  Antiparticles  are  different  from  particles  but  the  conservation  law  is  not  valid  as  in  the 
case  of  bosons  K^,  K^,  7r+,  n~  and  iiC+,  K' . 

3.  Particle  and  antiparticle  are  identical  as  in  the  case  of  n^  and  y. 

Since  in  (2)  and  (3)  the  conservation  law  is  not  applicable,  particles  can  be  absorbed  or 
emitted  singly  while  in  the  case  of  fermions  the  interaction  has  to  be  bilinear.  Linear  combi- 
nations of  fermions  and  antifermions  are  meaningless  in  view  of  such  conservation.  In  (3)  it 
is  trivial  though  the  photon  can  of  course  be  treated  as  a  linear  combination  of  right  and 
left  circularly  polarized  photons.  Since  conservation  of  charge  is  absolute,  there  is  no  meaning 
in  speaking  of  combinations  of  the  charged  jr-mesons.  So  the  only  meaningful  combination 
is  for  the  K  particles  and  this  is  possible  since  the  decay  channels  of  K^  and  K^  are  identical, 
but  linear  combinations  could  be  so  chosen  that  the  channels  can  be  apportioned  in  a  mutually 
exclusive  manner. 
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iteract  with  matter  will  produce  interactions  characteristic  both  of  ^  ==  +  1 
^and  S  ^  — 1.  This  behaviour  of  the  'stale  beam"  is  to  be  contrasted  with  that 
of  the  "fresh  beam"  of  iC-mesons  all  of  which  had  strangeness  +  1. 

Since  in  weak  interactions  only  Zj  and  K^  are  meaningful,  it  is  necessary  to 
consider  K^  and  K^  as  linear  combinations  of  K^  and  K^ .  In  strong  interactions 
however,  K^  and  K^  retain  their  respective  identities.  This  is  quite  obvious  from 
the  fact  that  we  can  have  a  "fresh  beam"  of  K^  or  K^  at  the  time  of  production. 

We  give  here  the  decay  modes  of  K^  and  K^ . 

In  analogy  with  t^  ,  the  K^  and  K^  particles  which  decay  into  three  mesons 
are  also  called  tJ  and  t^  respectively.  But  the  6^  decay  will  be  similar  to  the  t^ 
decay.  The  3  Ji°  decay  of  r^  corresponds  to  CP  =  —  1  and  the  7i+  mode  to  CP  =  —  1 . 
If  we  denote  by  I  the  angular  momentum  of  7r+  smd  7z~  in  their  centre  of  mass 
system  and  L  the  angular  momentum  of  ji^  in  the  t^  rest  system  for  the  r^  particle 
(corresponding  to  K^)  we  have  I  =  L  =  0,2,  .  .  .  K^  will  rarely  decay  through 
this  mode  because  of  phase  space  considerations.  However  for  the  tJ  particle  (cor- 


Name 

Decay  mode 

Branching  ratio 
(per  cent) 

Lifetime 

K^ei) 

Kl->7l'-+7l- 

86.6 

(1.00  ±  0.038)  X  10-^9  sec 

Kuei) 

Kl->27i^ 

14.6 

T? 

/!ro->jr+  +  7r-+7r» 

very  rare 

TO 

30 

KliK,,) 

K^-^e^-\-  v  +  n+ 

30 

(6.lil:5)  X  10-8  sec 

KtiK^,,) 

K».-^IU^^V^7l  + 

30 

responding  to  Xj)  iy=Z=  1,  3,  .  .  .  are  allowed  and  hence  the  outgoing  pions 
face  centrifugal  barriers  owing  to  their  angular  momentum  even  in  the  lowest 
configuration.  Since  Stt^  has  CP  =  — 1,  only  K^  can  decay  into  this  mode  while  K^ 
cannot  undergo  decay  through  this.^  For  K^  even  the  Sn  mode  is  less  important 
because  of  phase  space  considerations.  The  existence  of  the  long  lived  K2  particle 
has  been  established  by  experiments. ^ 

We  will  discuss  now  two  interesting  phenomena  which  depend  upon  considering 
K^  and  K^  as  a  linear  combination  of  K^  and  K^.^  From  the  above  discussion  it 
is  clear  that  at  the  time  of  production,  i.e.  ^  =  0  the  wave  function  describing 
the  K^  particle  has  the  form 

y,(KO,  0)  =  ~[y, (K\ ,  0)  +  iv(K%,  0)] .  (20) 


^  See  A.  Pais,  Proc.  7th  Rochester  Conference,  Interscience,  New  York  (1957). 
2  K.  Lande,  E.  T.  Booth,  J.  Impeduglia,  L.  M.  Lederman  and  W.  Chixowski,  Phys. 
Rev.  102,  1901  (1956). 

=^  A.  Pais  and  O.  Piccioni,  Phys.  Rev.  100,  1487  (1955). 
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This  state  changes  with  time  due  to  the  phase  changes  arising  from  the  difference 
in  energy  of  K^  and  K^  and  the  difference  in  decay  lifetimes.  Thus  at  a  later 
time  /,  the  wave  function  becomes 

tp{£y,  t)  =  ~7^e''"^^[e-'l^^^y,{Kl  0)  +  ie^^^' e-h'^'y,{K%  0)],  (21) 

where  w^  =  ymf  +  p^;         A  =  CO2-  o)iC^  — (22) 

and  Ai  and  Ag  are  the  decay  constants  for  K^  and  K^  respectively.  From  the 
interpretation  of  GrcU-Mann^  we  know  that  after  a  flight  time  long  compared 
with  10-^®  sec  we  will  be  left  with  a  pure  K^  beam.  Since  K^  contains  the  K^ 
component  with  ^  =  —  1  which  was  absent  in  the  initial  K^  beam  there  is  a 
possibility  of  observing  hyperons  if  the  K^  beam,  at  distances  far  from  the  primary 
target,  is  allowed  to  interact  with  nuclei  and  this  will  provide  a  convincing  proof 
for  the  existence  of  K^ .  Supposing  this  secondary  target  is  thick  enough  to  remove 
a  considerable  amount  of  K^  from  ^2  >  *^®  ^"  which  interacts  little  with  the  nuclei 
will  pass  on  beyond  this  target  and  thus  there  is  a  regeneration  of  the  shorty 
lived  K^  far  from  the  previous  target.  We  can  then  observe  the  characteristic 
271  decay  of  K^  component  of  this  secondary  K^  beam.  We  shall  now  consider 
this  process  in  a  quantitative  way.  Denoting  the  wave  function  at  time  t  by 

=  K(0  V^(^?)  +  a,(t)y^{K^^)]e'^-\  (23) 


where 


a,W.^(f);         a.W^i'i^  (24, 


we  arrive  at  the  following  differential  equations  for  amplitudes  aj^(t)  and  a^it) 


•^">+i-^-T^h 


da^it)  ./I  1    \        ,      /.  .  1  1 


dt 


(25) 


(26) 


where  t^  and  t^  denote  the  mean  time  of  flight  for  K^  before  scattering  and  for  K^ 
before  nuclear  absorption  respectively,  t  =  l/Aj  being  the  mean  lifetime  for  K^ 
decay.  Initially  we  start  with  a  pure  K^  beam  and  do  not  take  into  account  nuclear 
absorption,  i.e.  after  a  lapse  of  time  of  the  order  10~^'*  sec  we  have  a  K^  beam. 
We  shall  use  the  above  equations  to  study  what  happens  to  this  K^  beam  of 


^  M.  Gell-Mann  and  A.  Pais,  loc.  cit. 
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velocity  v  when  passing  through  a  nuclear  matter  of  thickness  d  —  vt.  Thus  at 
/  -  0,  ai(0)  =  0,  a2(0)  =  A.  Then 


(hit)  =  —iXA 


fi2-f^i 


(27) 


where 

/*2,  /^i  =  H-  (0^  +  /^)  ±  1/{4P  +  (^  -  /?)2]  (29) 

The  absorption  curve  for  K^  is  given  by  the  ratio  of  the  final  intensity  to  the 
initial  one. 


I  id) 


(30) 


i(0)         |a2(0)|2 

Serber  ^  has  pointed  out  that  if  dm  is  large  relative  to  the  K^  lifetime  width,  nuclear 
absorptions  leading  to  hyperons  can  occur  within  the  material  rather  close  to  the 
primary  target.  In  this  case,  the  wave  function  oscillates  between  y){K^)  and 
ip{K^)  and  if  the  oscillatory  variation  of  these  reactions  with  distance  can  be 
measured,  this  should  allow  an  estimate  of  ^m  despite  its  smallness. 

It  is  not  yet  clear  whether  K^  can  decay  into  a  pion  and  two  leptons.  Assuming 
that  both  K^  and  Xg  can  have  such  a  decay  mode  with  similar  rates,  we  can  dis- 
cuss some  interesting  effects  due  to  interference  between  K^  and  K'^.^  It  is 
expected  that  these  decays  will  have  a  complicated  time  dependence  rather 
than  the  usual  exponential  one.  The  observation  of  such  effects  will  directly 
confirm  the  theoretical  prediction  of  K^  and  K^.  Considering  only  the  decay 
{jiev)  of  a  neutral  K  particle  for  K^,  the  final  state  is 

y){Kl)  ->  «i[t/^(e+  +  v  +  7i-)  +  y){e-  +  v-\-  rc+)]  (31) 

where  a^  denotes  the  decay  amplitude  for  the  process  while  for  K^ 

y,(Kl)  ->  a^[xp{e^  +  v  +  7i-)-xp{e-  +  v  +  tz^)].  (32) 

Starting  with  an  initial  K^  state  we  have  at  a  later  time  a  state  described  by  the 
wave  function  y){t),  the  quantity  \iK^  |  ^(0>  P  gi^i^ig  the  fraction  of  the  original 
K^  remaining  in  the  beam  after  time  t,  while  |<^'^  |^(0>  P  gives  the  fraction  of 
the  original  K^  turned  into  K^.  The  sum  of  these  fractions  keeps  decreasing  with 
increasing    t   as   the    mesons   decay.    The    rates   at   which    {e~  ^  v  +  n^)    and 

1  R.  Serber  and  A.  Pais,  Phys.  Rev.  09,  1551  (1955);  K.  M.  Case,  Phys.  Rev.  103,  1449 
(1956).  M.  L.  Good,  Phys.  Rev.  106,591  (1957).  The  most  convincing  experiments  concemmg 
the  existence  of  X^  are  the  observation  of  hyper  fragments  in  emulsion  stacks  located  near 
a  target  exposed  to  iC+ -meson  beam.  This  production  of  hyperfragments  must  be  obviously 
due  to  the  following  process : 

iC+  -^  K^  (charge  exchange  scattering),  followed  by  Pais-Piccioni  oscillations  producing  K^y 
giving  rise  to  hyperfragments. 

2  S.  B.  Treiman  and  R.  G.  Sachs,  Phys.  Rev.  103,  1545  (1956). 
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{e-^  +  V  +  7i~)  decays  occur  at  time  (  are  proportional  to  the  squares  of  the  ampli- 
tudes of  these  two  states  in  (31  and  32).  Thus  we  have- 

i?(e^  + v-j-:7r-)  =  j{afe-V4.a|e-V-f.2aia2COs(M)e-^^+VV.}  (33) 

Hie-  +  v  +  jT^)  =  ijaf  e--^!'  -f  afe'V  -  2aia2COs(M)  e-<^^V'/.}.  (34) 

It  is  to  be  not«d  that  these  expressions  contain  the  interference  terms  with  an 
exponentially  diminishing  factor  and  an  oscillatory  factor  depending  upon  the 
energj'  difiference  between  iTj  and  K^.  The  total  rate  of  decay  into  {nev)  states 
follows  the  exponential  law  corresponding  to  mixed  K^  and  K^,  i.e. 

(35) 


.2«-A,« 


jR(e-'  +  v-\-7i-)-{-  R{e-  +  V  +  7r+)  =  af  e"^**  +  a|e 

The  special  case  a^  =^  a^  is  of  interest,  for  then  K^  can  decay  only  to  e"*"  +  v  +  tt" 
and  K^  only  to  (e'  +  v  +  7r+).  If  we  start  with  a  pure  K^  beam,  then  near  the 
origin  i? (6"  +  v  +  7i^)  must  be  zero.  However  as  time  proceeds,  the  beam  acquires 
the  K^  component  and  the  strength  is  measured  by  the  appearance  of 
R{e~  +  V  -\-  n^).  The  time  dependence  of  the  electron  and  positron  decay  modes 

in  this  case  for  two  values  of  energy  difference  A  =  — ^ are  shown  in  the  figure. 
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Fig.  35.  The  decay  rate  of  theiT "-particle  into  the  modes  e+  vn"  and  e~  vn'^  as  a  function 
of  time  under  assumptions  a^  =  a^^  for  two  values  of  the  energy  difference.  It  is  assumed 

that  ^2  <  Aj . 


Exactly    similar    effects    can    also    occur    when    K^    decays    into    final    states 

(fjL^  -f  r  +  TT  )  and  [fi~  +  V  +  rTT"*").  A  time  dependence  of -^  or ratio  in  the 

decay  of  neutral  beam  can  be  absent  if  this  ratio  equals  unity,  i.e.  a^  =  0  or  ag  =  0. 
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It  might  also  be  due  to  the  energy  difference  A  being  so  large  that  the  oscillatory 
terms  do  not  give  observable  effects. 

In  the  above  discussion,  we  have  assumed  that  CP  in  variance  is  obtained 
which  also  implies  through  the  CPT  theorem  that  the  interactions  are  invariant 
under  time  reversal.  Lee,  Yang  and  Oehme^  have  discussed  the  situation  when 
time  reversal  in  variance  fails.  They  show  that  we  can  still  obtain  K^  and  K^ 
(which  are  mixtures  oiK^  and  K^  states)  with  different  lifetimes.  They  also  point 
out  that  a  charge  asymmetry  between  {e'^  +  7i~  +  v)  and  {e~  +  Tf^  +  v)  modes 
is  possible  for  the  K^  beam. 

1^1  /|  =  J  rule 

A  phenomenological  rule  was  proposed  to  explain  the  lifetime  and  branching 
ratio  of  the  strange  particle  decays  by  postulating  that  the  decay  interaction 
Hamiltonian  transforms  like  a  spinor  of  rank  ^/g  in  the  isotopic  spin  space.^ 
Consequently  the  isotopic  spin  of  the  initial  and  the  final  state  will  differ  by  ^/g ,. 

\If-Ii\  =  \Al\=^i.  (36) 

This  rule  is  termed  the  |  J  / 1  =  i/g  rule  and  can  also  be  interpreted  as  follows.^'*  In 
the  decay  of  a  particle  an  additional  particle  called  the  "spurion"  is  emitted  or  ab- 
sorbed that  carries  no  energy  or  momentum  or  charge  but  does  carry  an  isotopic  spin 
1/2  and  whatever  the  I^  component  is,  that  is  needed  to  conserve  I^ .  With  the  inclu- 
sion of  the  spurion,  the  ' '  violation  rule"  |  Zl  /  ]  =  ^/g  can  be  dropped  and  we  are  allowed 
to  assume  that  /  is  conserved  in  the  decays.  This  rule  has  quite  an  amount  of  success 
in  explaining  the  branching  ratios  between  different  modes  of  decays  of  hyperons 
as  well. 

Let  us  discuss  the  utility  of  this  rule  regarding  K^  and  K^  decays.  Application 
of  rule  I  Zl  / 1  =  ^  to  ^  decay  requires  that  the  final  state  must  have  /  =  0  or  7  =  1 
since  I{K)  =  J.  When  a  spinless  particle  decays  into  two  tt  mesons  (i.e.  6  mode), 
the  two  pions  are  in  the  S  state.  Because  jt's  are  bosons,  their  complete  wave 
function  must  be  symmetric.  We  know  that  the  relative  orbital  motion  is  also 
symmetric  and  thus  their  isotopic  spin  wave  function  must  be  symmetric.  Thus 
the  final  state  having  /  =  1  is  excluded.  However  K^  cannot  decay  into  7=0 


1  T.  D.  Lee,  C.  N.  Yang  and  R.  Oehme,  Phys.  Rev.  105,  1671  (1957). 
-  M.  Gell-Mann  and  A.  Pais,  Proc.  Int.  Conference  on  High  Energy  Physics,  Pergamon, 
London  (1955). 

*  Due  to  G.  Wentzel,  see  p.  24.  M.  Gell-Mann  and  A.  Rosenfeld,  loc.  cit. 

*  It  is  shown  by  Salam  and  Ward  (Phys.  Rev.  Lett.,  5,  390  (I960))  that  conventional 
field  theory  contains  a  natural  realization  of  "spurions".  The  vacuum  expectation  value 
<Kj)q  is  non-zero  and  is  proportional  to  /«,  where  f^K^n^TT  is  the  weak  interaction  Hamil- 
tonian causing  the  decay  K^-^  n^  +  n".  Thus  even  the  isotopic  spin  conserving  T  product 
of  operators  such  as  AN^ttiK  has  a  non-zero  matrix  element  for  the  decay  A-^  N  -\-  tc 
through  ylA^+r;r<Z>o.  Thus  <Z^>o  gives  a  realization  for  the  spurion.  Parity  non- conservation 
is  explained  by  postulating  a  X'-meson  of  opposite  (strong)  parity  to  K  with  <-K^'>o  =^  0. 

16* 
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siiice  their  final  state  has  /^  -  ±  1  while  K^  can  decay  into  7  =  0  state.  K^  decay 
can  occur  only  through  the  violation  of  the  rule  \Al\  —  ^f^.  Perhaps  electro- 
magnetic corrections  to  the  rule  are  responsible  for  the  decay.  At  any  rate  the 
rule  explains  the  rapid  decay  of  K^  into  2n  state  as  compared  with  K^. 

We  obtain  as  a  second  result  that  the  ratio  of  K^  decay  into  two  modes 

-^n^  +  n-  (37) 


should  be  ^/j  i.e. 


,      Kl-^'^^      .,  (38) 


At  present  there  are  these  experimental  values  for  the  ratio,  viz.  0.30  ±  0.06 
and  0.14  ±  0.06.^  We  have  already  noted  that  the  ratio  of  —  =  0.332  ±  0.067. 

For  a  pseudoscalar  t,  S  wave  emission  of  three  pions  is  possible  and  since  pions 
are  bosons,  the  final  isotopic  spin  state  must  be  symmetrical.  Let  us  try  to  predict 
the  ratio  by  the  |zl  / 1  =  i/g  rule.  Since  the  final  state  carries  a  unit  charge  /^  =  +1, 
/  ==  1  is  the  only  possible  final  state.  Then  the  ratio  of  squares  of  the  matrix 
elements  \  M {jc^  +  27if*)  \^  :  j if  (tt"  +  2:7r+)  p  is  determined  to  be  1 :  4.  Remember- 
ing however  that  the  ratio  of  Q  values  for  the  two  processes  is  84  :  75  the  phase 
space  ratio  becomes  1.3: 1.  Thus  the  branching  ratio  should  be  1.3:4.325,  a  result 
quite  consistent  with  the  experimental  ratio.  If  we  now  consider  "t"  decays  of 
neutral  JT-mesons,  we  note  that  there  are  two  possible  charge  states  Stz^  and 
71"^  +  7i~  -\-  7if^.  Assuming  CP  invariance  Stz^  has  CP  =^  — 1  and  hence  is  allowed 
for  K^  but  is  forbidden  for  K^ .  Of  course  we  have  assumed  that  K  is  spinless.  Without 
a  knowledge  of  the  wave  function  we  cannot  decide  CP  parity  oi  jc^  -^  7i~  +  ttP 
state.  Experimentally  the  fraction  of  decays  into  Sn  is  quite  appreciable.  Assuming 
that  the  final  Sti  state  is  totally  symmetric,  Gell-Mann^  finds  that  |zl/|  =  1/2 
rule  requires  that  the  total  rate  of  K^^Sn  should  equal  the  total  rate  of  K+  ->  Sn 
while  the  ratio  of 


Kl->7t^  +  71-  +  77P       2 


From  the  above  application  of  |^  / 1  ==  ^U  rule  to  K+  decay,  we  find  it  fails 
to  explain  correctly  the27r  mode  of^^.  Gell-Mann^has  put  forth  some  proposals 
assuming  the  |zl/|  =  ^/g  rule  to  be  approximately  valid.  To  obtain  a  non-zero 
decay  rate  for  R-^  -^  ti'^  -{-  Jif^  he  proposes  that  contributions  from  |^  / 1  =  ^/g 
and  possibly  from  |^  / 1  =  V2  should  be  taken  into  account.  Denoting  the  complex 


^  See  "Strange  Particle  Decays",  Report  by  D.  A.  Glaser,  International  Conference  on 
High  Energy  Physics  at  Kiev,  1959,  and  Crawford  et  al.,  Phys.  Rev.  Letters  2,  266  (1959). 

2  M.  Gell-Mann  and  A.  H.  Rosenfeld,  loc.  cit. 

3  M.  Gell-Mann.  Nmyvo  Cim.  5,  76  (1957). 
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amplitudes  by  £3  and  £5  respectively  relative  to  that  for  ]zl  / 1  =  1/2,  the  ratio 
of  271  decay  mode  for  K^  and  K^  and  the  ratio  /  becomes 


X^-v27r 


K^-^2n        3    1+  1^3+ £5 


13/     c    _p    |2 

"'"'    ^^'    =r.  (40) 


1    \l-P{e,  +  e,)f 

On  setting  y  =^  ^/^oo  as  indicated  by  experiment,  Gell-Mann  finds  that  /  =  0.14 
can  be  made  to  fit  with  £3  ^  12  per  cent;  £5  ~  11  per  cent  or  with  £3^6  per  cent 
and  £5  :^  17  per  cent.  It  seems  improbable  that  electro-magnetic  effects  can  give 
rise  to  such  large  corrections.  Thus  we  can  conclude  that  lZl/|  =  ^/g  is  only  ap- 
proximately valid  even  for  weak  interactions  and  we  can  expect  violations  to  the 
order  of  ten  per  cent.  It  is  not  clear  whether  the  \Al\  =  ^/g  rule  can  be  extended 
to  other  decay  modes  involving  leptons,  of  course  assuming  the  leptons  to  possess 
zero  isotopic  spin.  Sudarshan  et  al.^  have  suggested  that  the  strangeness  violating 
non-leptonic  current  should  transform  like  an  isospinor.  We  will  discuss  the  con- 
sequences of  this  later. 

^  E.  C.  G.  SuDARSHAX,   S.  Okubo,  R.  E.  Marshak   and  S.  Weinberg,   Phys.   Rev,   112, 
665  (1958). 


CHAPTER  VII 

WEAK  INTERACTIONS 

1.  WEAK  INTERACTIONS^ 

We  remakked  earlier  that  the  t  —  0  puzzle  does  not  exist  if  we  accept  the 
violation  of  parity  conservation  in  decay  interactions.  Lee  and  Yang^  first  made 
this  suggestion  and  proposed  a  large  number  of  experiments  which  can  test  the 
hypothesis  of  non-conservation  of  parity  in  decays.  As  we  shall  see  presently 
experiments  have  overwhelmingly  supported  this  hypothesis.  Since  from  the 
celebrated  theorem  of  Pauli  and  Luders^  any  interaction  should  be  invariant 
under  the  combined  operations  of  parity  (P),  charge  conjugation  {C)  and  time 
reversal  (T),  and  as  parity  is  not  conserved,  at  least  one  of  the  operations  C  and  T 
should  be  violated.  It  is  believed  from  theoretical  arguments  and  experimental 
evidence  that  C  is  violated  along  with  P. 

Weak  interactions  responsible  for  decays  can  be  broadly  divided  into  two 
classes : 

(a)  Decays  involving  leptons, 

(b)  Decays  not  involving  leptons.  i.e.  the  particle  decays  only  into  strongly 
interacting  particles.  Such  a  classification  is  made  since  as  we  shall  see  later  it 
is  possible  to  explain  features  like  parity  non-conservation  in  the  leptonic  decays 
by  the  two- component  theory  of  the  neutrino,  an  explanation  which  cannot 
be  appUed  to  the  non-leptonic  decays.  For  the  latter  we  further  seem  to  have  a 
selection  rule  |zJ  / 1  =  ^/g .  Despite  these  seemingly  different  characteristics,  the 
coupling  constants  for  these  two  classes  are  remarkably  identical.  Of  course,  we 
do  not  know  how  to  calculate  the  various  decays  since  we  do  not  have  a  complete 
knowledge  of  how  the  various  fields  are  coupled  though,  as  we  shall  see  presently, 
recent  developments  indicate  a  universal  form  of  interaction.  Even  in  the  absence 
of  a  knowledge  of  detailed  interactions,  we  can  estimate  the  coupling  constant 
by  the  simple  formula 

r  =  271 0'l^'^eE  (1) 

^  Part  of  this  chapter  closely  follows  the  discussion  on  Weak  Interactions  in  T.  D.  Lee 
and  C.  N.  Yang's  "Elementary  Particles  and  Weak  Interactions".  —  Lectures  given  at  the 
Brookhaven  National  Laboratory,  U.S.A.  (1957). 

2  T.  D.  Lee  and  C.  N.  Yang,  Phys.  Rev.  104,  254  (1956). 

^  W.  Pauli's  particle  in  Niels  Bohr  and  the  Development  of  Physics,  Pergamon,  London 
1955;  see  also  G.  Luders,  Kgl.  Danske  Vidensk.  28,  No.  5  (1954). 
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where  t  is  the  lifetime  of  the  particle,  q^  is  the  number  of  final  states  per  unit 
energy,  and  R  the  characteristic  length  of  these  decays  which  can  be  crudely 
estimated  from  the  unbalance  in  energy  by  means  of  the  uncertainty  principle. 

We  have  listed  below  the  estimates  of  G"^  for  a  number  of  decays  belonging  to 

h 
both  classes.  For  simplicity  we  take  R 


m„c 


Decays 

Lifetime  (sec) 

G^xlOi* 

A^'-^p  +  n- 

3      X 10-10 

1.6 

E~^n  +  n- 

1.6    X  10-10 

1.2 

Kl^2n 

0.95x10-10 

2.8 

7l±^fJl±  +  V 

2.6    xlO-8 

0.3 

^±  ->  e±  +  V  4-  r 

2.2    xlO-« 

2.0 

K±  -^  fx±  +  V 

1.2    xlO-8 

0.02 

Despite  the  wide  variation  in  lifetimes  of  the  particles,  the  coupling  constants 
G^  seem  to  be  identical.  This  remarkable  feature  supports  the  surmise 
of  a  common  origin  for  all  weak  interactions. 

Decays  involving  leptons  can  be  further  classified  as  follows: 
(i)  Those  involving  only  leptons,  e.g.  jj,^  ->  e^  +  v  +  v. 

(ii)  those  which  involve  no  change  of  strangeness  AS  =  0,  e.g.  n -> p  +  c  +  v, 
and  (iii)  Those  involving  change  of  strangeness  Zl^  =  1,  e.g.  A->  p  +  e    -\-v. 

The  non-leptonic  decays  like  A->  p  -{-7i~  always  involve  change  of  strangeness. 
All  the  weak  interactions  in  the  current-current  picture  can  be  generated  from 
the  four  typical  interactions  by  the  introduction  of  a  strong  vertex,  e.g.,  tt  gets 
replaced  by  iV^^,  K  by  AN  etc.,  and  also  by  replacing  an  electron  by  a  ^-meson. 

Before  discussing  the  theory  of  weak  interactions  we  shall  first  outline  in  a 
general  way  how  observational  quantities  can  reveal  in  variance  or  non-in  variance 
under  symmetry  operations  like  P  and  T,  and  apply  such  considerations  to  parti- 
cular cases  like  the  ^  decay. 


Tests  for  inyariance  under  C,  P,  T 

(a)  Parity :  If  the  state  of  a  quantum  mechanical  system  is  not  invariant  under 
parity,  it  implies  that  the  eigenstate  of  the  Hamiltonian  is  not  an  eigenstate 
of  the  parity  operator  (i.e.  a  state  ^  is  a  linear  combination  ay)+  +  bip^  where  y)^. 
and  y)_  are  even  and  odd  parity  eigenstates.  Let  a  quantum  mechanical  system 
A  decay  to  a  stated.  Parity  non- conservation  implies  that  if^  is  an  eigenstate 
of  parity,  B  is  not.  In  the  double  scattering  of  protons^  a  beam  of  incoming  pro- 


1  O.  Chamberlain,  E.  SbgrS,  R.  Tripp,  C.  Wiegand  and  T.  Ypsilantis,  Phys.  Rev. 
1430(1954). 
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tons  with  momentum  p^  is  scattered  by  a  target  into  a  state  with  momentum 
P2  J  which  is  further  scattered  by  a  second  target  into  momentum  p^ .  The  cross- 
section  for  double  scattering  should  be  independent  of  |>3  •  (p^  x  p^)  which  is  a  pseu- 
doscalar  (see  below).  The  measurement  reveals  no  dependence  on  P3  •  (Pj  x  Pg)  and 
this  supports  the  conclusion  that  parity  is  conserved  in  strong  interactions.  Ex- 
perimental evidence  indicates  that  parity  is  conserved  by  nuclear  forces  down  to 
one  part  in  10'  in  intensity.^  This  is  based  on  the  study  of  the  yield  of  the  highest- 
energy  group  of  oc  particles  in  the  reaction 

The  spin  and  parity  of  O^*  and  a-particles  are  both  0+.  The  reaction  can  have 
a  resonance  corresponding  to  a  known  1+  state  of  Ne^**  if  parity  is  not  conserved. 
No  such  resonance  was  found.  The  conservation  of  parity  in  electromagnetic 
interactions  is  established  experimentally  to  about  one  part  in  10®  in  intensity 
by  the  validity  of  selection  rules  in  atomic  transitions. 

In  any  decay  process  we  will  have  particles  emerging  in  various  directions 
with  different  energies  and  we  normally  measure  the  distribution  in  intensity, 
i.e.  the  number  of  particles  emerging  at  a  given  angle.  This  distribution  function 
can  depend  on  many  quantities  that  can  be  formed  from  the  momenta  and  spins 
of  the  interacting  particles.  However  the  sjmimetry  properties  of  the  interaction 
responsible  for  the  decay  in  question  will  limit  the  dependence  of  the  distribution 
function  only  to  certain  of  these.  If,  for  example,  an  interaction  is  invariant  under 
rotation,  then  it  is  clear  that  the  distribution  function  should  contain  only  quan- 
tities which  are  also  invariant  under  rotation.  In  a  similar  manner  the  distribution 
function  can  depend  on  pseudoscalar  quantities  only  when  the  interaction  is  not 
invariant  under  P.  It  is  convenient  to  discuss  the  P  decay  interaction  since  it  is 
one  of  those  weak  interactions  which  has  be^n  already  studied  experimentally 
and  theoretically  though  not  with  the  object  of  testing  the  postulate  of  parity 
conservation. 

Admitting  non- conservation  of  parity  the  most  general  interaction  Hamiltonian 
for  p  decay  is 

^iut  =  Z  {%  Oi  Wn)  m  %  Oi  W.)  +  (C;  We  Oi  75  Wv)]  ,  (2) 

i 

where  i  =  8,T,  V,  A,  P  denotes  the  well  known  bilinear  co variants  of  the  Dirac 
spinors,  i.e.  scalar,  tensor,  vector,  axial  vector  and  pseudoscalar,  G^  and  CI  being 
the  ten  complex  coupling  constants.  The  second  term  is  pseudoscalar  while  the 
first  term  is  the  **old'*  interaction  Hamiltonian  and  is  scalar.  The  square  of  any 
typical  matrix  element  has  the  form 

\Mf={2]fiiCrCj-hc-c)  +  {y:fi-Cl*C;  +  c-c)  +  {29iiCrG;  +  C'C),       (3) 


1  N.  Tanner,  Phys.  Rev.  107,  1203  (1957).  Also  D.  H.Wilkinson,  Phya.  Rev.  109,  1603, 
1610,  1614  (1958);  R.  Haas,  L.  B.  Leipuner  and  R.  K.Adair,  Phys.  Rev.  116,  1221  (1959). 
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where  f^j,  flj  and  g^j  are  functions  of  the  momenta  and  spins  of  the  particles. 
It  is  important  to  note  that  any  transformation  which  leaves  the  Hi^^  invariant 
will  leave  (3)  invariant.  As  has  been  shown  earlier  both  ip^  and  75  xp^  are  solutions 
of  the  Dirac  equation  for  the  neutrino. 

(a)  Let  us  make  the  transformation 

VV  ->  75  Wv ;  Ci  ->  CI     and     q  ->  Ci .  (4) 

This  leaves  Hij^^  invariant.  Hence  it  follows  that 

h  =  fh'  (5) 

(b)  Ci^C^;  C'l^—Cl  with  space  inversion.  This  transformation  also  leaves 
^ij^t  invariant.  Note  that  the  minus  sign  acquired  by  the  second  term  in  the  Hamil- 
tonian  under  P  is  compensated  by  the  minus  sign  of  CI .  Since  under  P  we  have 
r  ->  —  r,  p-^  — p  and  spin  s  ->  s,  we  can  deduce  that 

hi(-p,  s...)=fij{+p,  s...)  (6) 

/^(-P,  s...)  =  /^(-hl>,  s...)  (7) 

and 

gij{-p,  s...)  =  -gr.^-(+p,s...).  (8) 

It  has  thus  been  shown  that  f^j  and  f^f  are  scalars  while  the  g^/s  are  pseudo- 
scalars.  Since  only  the  term  containing  g^j  involves  the  interference  of  both  the 
scalar  and  pseudoscalar  parts  of  -H^i„t,  in  order  to  test  the  question  of  parity 
conservation  we  must  measure  a  pseudoscalar  quantity.  Thus  if  in  the  P  decay 
from  oriented  nuclei,  the  angular  distribution  of  the  ^  particles  depends  on  the  pseudo- 
scalar  j  •  |>g  where  j  is  the  polarization  of  the  oriented  nuclei  andpg  the  momentum 
of  the  p  particle,  it  implies  an  asymmetry  of  this  distribution  about  the  direction 
of  the  spin  of  the  oriented  nuclei  and  this  asjmametry  can  be  detected  by  experi- 
ment and  will  constitute  a  proof  for  the  violation  of  parity  conservation. ^  It  can 
be  noted  here  that  the  measurement  of  any  scalar  quantity  in  the  new  ^^^t  i^ 
equivalent  to  working  with  the  "old"  parity- conserving  H^^^  provided  we  replace 
CfCj  in  the  old  formulae  by  {CfCj  +  Cl*Cj)  which  essentially  means  only  a 
redefinition  of  the  constants  C^ . 

Recalling  the  CPT  theorem  discussed  earlier,  we  deduce  that  if  weak  inter- 
actions fail  to  be  invariant  under  P,  then  it  implies  that  the  interaction  must 
violate  at  least  one  of  the  other  in  variances  under  C  or  T.  It  might  be  argued 
that  equality  of  masses  and  lifetimes  in  the  decays  of  charge -conjugate  particles 
like  71^,  jLi^  have  already  established  that  charge -conjugation  in  variance  is  valid 
in  weak  decays.  However  Lee,  Yang  and  Oehme^  pointed  out  that  the  equality 
of  masses  and  lifetimes  for  a  particle  and  its  anti-particle  follows  directly  from 
proper  Lorentz  in  variance  and  the  CPT  theorem  whether  separate  conservation 


1  T.  D.  Lee  and  C.  N.  Yang,  Phys.  Rev.  104,  254  (1956). 

2  T.  D.  Lee,  R.  Oehme  and  C.  N.  Yang,  Phys.  Rev.  106,  340  (1957). 


(10) 


250  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

of  C,  P  and  T  is  obtained  or  not.  This  theorem  is  true  only  in  the  absence  of  any 
final  state  interactions.^*^ 

(b)  Time  reversal:  We  shall  now  discuss  the  consequences  of  invariance  or 
non-invariance  under  time  reversal.  Let  i^^  be  the  weak-interaction  Hamiltonian 
which  causes  the  decay  A-^  B.  Writing  Hg,  in  the  form  2]  0(11^  such  that  under  T 

we  have  T H^T'^  =  Z Cf  Hf     and     T HiT'^  =  H^,  (9) 

we  note  that  if  the  C/s  are  real  or  at  least  all  the  C/s  have  the  same  phase 
then  time  reversal  invariance  holds.  Thus  to  prove  non-invariance  xmder  time 
reversal  we  must  measure  the  dependence  of  the  distribution  function  on  quan- 
tities whose  mean  values  depend  on  the  relative  phases  between  these  O/s. 
For  simplicity  we  shall  assume  that  there  exist  no  final-state  strong  interactions 
between  the  decay  products.  If  free  particle  states  of  momenta  Pi,p2  -  ■  -  and 
spin  Si,  §2  ^^^  denoted  by  |p,  s>,  then  the  final  state  B  can  be  written  as 

p, « 

=-I]CiM,{p,  s)\p,  s>, 
t 

^^^'^  Mi{p,s)  =  <p,s\Hi\Ay.  (11) 

We  know  that  m\        ^       ^  \  /,^v 

T\p,  s>  =  <-!>,  -s\  (12) 

and  hence  ,^^.  ^  hm-  ,  x  /,ov 
Mfip,  s)  =  Mi(-p,  -s).                                       (13) 

^  The  proofs  of  these  statements  will  now  be  indicated  (see  G.  Luders  and  B.  Zumino, 
Phys.  Rev.  106,  385  (1957).  T.  D.  Lee  and  C.  N.  Yang,  Phys.  Rev.  105,  1671  (1957)).  That 
masses  of  particles  and  antiparticles  are  equal  easily  follows  on  noting  that  CPT  is  a  unitary 
transformation  and  hence  the  eigenvalues,  in  particular  mass,  will  not  be  altered  under  such 
a  transformation  provided  the  HamUtonian  is  left  invariant  under  CPT.  We  recall  that 
particle  and  antiparticle  states  go  over  into  each  other  under  CPT. 

To  prove  the  life  time  equality,  consider  the  decay  A-^  B  and  A  -^  B.  Let  us  put  the  weak 
interaction  part  Hod  &8 

H^  =  H^  +  H_  (!) 

with  H^  =  K^c  ±  PB^P-']  (2) 

where  PH^P~^  =  ±  H^.  Also  let  us  denote  the  final  state  as  15>  =  |5^>  +  \B_y.  Applying 
CPT  theorem  and  using  rotational  invariance  we  can  show 

<:B^\H^\Ay*  =  +<B^\H^\Ay     ■  (3) 

and  _ 

<5_|^_|^>*  =  -<^_1^_|^>  (4) 

Since  the  two  matrix  elements  differ  in  phase  by  Tr/g,  there  can  be  no  interference  between 
the  two  matrix  elements.  Thus  the  lifetimes  t^  and  t^  are 

T^ocZKB^\H^\Ay\^  +  K^-I^-I^>l'  (5) 

r20ci:\<^B-\H-\Ay\^  +  \<B_\HSAy\K  (6) 

(Thus  T^  =  T2 .  However  if  final  state  interaction  is  present,  then  (3)  and  (4)  are  no  longer  true 
cf.  our  discussion  of  in  variances  with  final  state  interaction.  Hence  the  lifetimes  of  particles 
and  antiparticles  are  not  identical). 

2  See  S.  Okubo,  Phys.  Rev.  109,  984  (1958). 
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The  mean  value  of  any  observable  0  in  the  final  state  will  be  given  by 

<v|0|v>=2;'CfC,-0y,  (14) 

wh  PTP 

Oij=2]^Mr{p,s)Mj{p,s)<p,s\0\p,sy.  (15) 

We  note  that  Of,.  =  0,i.  (16) 

If  we  divide  the  observables  into  even  and  odd  functions,  i.e.  0+  and  0_ ,  then 
under  T  we  have 

<p,  s\0^\p,  s>  =  ±<-l>,  -s|Oi|-i>,  -s>,  (17) 

where  TO^T-^=^  ±0^. 

From  (16)  and  (17),  we  deduce  that 

(0+).-,-  =  {0^)h  =  real  (18) 

and  (^-)ij  =  — {^-)ji  =  imaginary.  (19) 

Thus  we  see  that  the  mean  values  of  Oj_  are  related  to  the  real  and  imaginary 
parts  of  products  of  C*  Cj . 

<v>\^.\y>y  =  S(^.)ii{CfCi  +  c,cf)  (20) 

<vlo-iv>  =  i;(o.)i,(Cfc,.-CiC;).  (2i) 

Remembering  that  the  imaginary  parts  of  Cf  Cj  exist  only  when  they  have  relative 
phases,  we  find  that  the  measurement  of  a  dependence  on  0_  will  constitute  a 
proof  for  non-invariance  under  time  reversal  in  the  absence  of  final  state  inter- 
actions. 

The  assumption  of  the  absence  of  final  state  interaction  is  only  an  ideal  one, 
for  there  always  exists  some  form  of  final  state  interaction  in  the  decays  such  as 
Coulomb  interaction  in  the  ^  decay  between  the  /?  particles  and  the  recoil  nucleus. 
The  free  particle  solutions  will  no  longer  be  eigensolutions  of  the  strong  part 
of  the  Hamiltonian.  It  is  well-known  that  the  exact  solution  will  consist  asymptoti- 
cally of  a  plane  wave  and  incoming  spherical  waves.  Under  time  reversal  this 
solution  will  correspond  to  a  plane  wave  of  opposite  momentum  plus  outgoing 
spherical  waves.  We  also  know  from  elementary  scattering  theory  in  quantum 
mechanics  that  the  asymptotic  solution  differs  from  the  plane  wave  by  a  phase 
shift  depending  on  the  interactions.  Let  us  denote  the  eigenstate  of  if  strong  by  |-B>. 
The  final  state  xp  will  be 

y=   i;<p,  s|B»"*><-B°'>'|H„|4>ip,  s>.  (22) 

B,  p,  s 

The  state  |5°^*>  is  the  outgoing  part  of  the  exact  solution  \By.  Let  us  denote 
the  phase  factor  of  l^®*^*)  by  e"*"'^  where  r]^  is  the  phase  shift  due  to  the  final 
state  interactions.  Then  we  can  write  |^>  as 

\W>=i;Gi^''^^iB{P,  s)\p,  sy,  (23) 

"^^^"^  Mf^{p,s)^M,s{-p,-s).  (24) 
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Proceeding  as  before,  we  find  that 

+  t(CfC,.-C<Cf)sin(ry^-^^,)] 
<V|0-|  V>  =  i;{0,)iB.  iBiiCr  Ci  -  (7,  Cf )  cosirjB  -  rjB') 
+  HCrCi  +  CiCf)Bm{rjB-r]B')]' 


(25) 


(26) 


The  important  modification  due  to  the  presence  of  final  state  interactions  is  that 
both  0+  and  0_  depend  on  the  imaginary  parts  of  CfCj  and  hence  measurements 
of  a  dependence  on  either  0+  or  0_  will  serve  as  proof  for  the  violation  of  time  re- 
versal invariance.  We  shall  conclude  this  general  discussion  with  the  following 
observations. 


Let  us  write  an  interaction  Hamiltonian  Hto  as 


(27) 


where  H^  and  H^  commute  and  anticommute  with  P  respectively,  i.e.  H^  con- 
serves parity  while  H^  does  not.  Let  us  also  write 

Hr=i:{Or),Hi    and    H^=  2(C,)iHi-  (28) 

If  time  reversal  holds  for  both,  then  (C^i  and  (C^i  must  all  be  real.  From  CPT 
theorem  it  follows  that  for  charge  conjugation  symmetry  to  hold,  all  (C^)^  must  be 
real  while  all  (C^i  must  be  imaginary.  Thus  (O^)^  and  (Cg)^  differ  in  phase  by  nj^ 
and  hence  there  can  be  no  interference  between  the  two  parts  of  the  Hamiltonian. 

This  proves  the  very  important  result  that  if  time  reversal  invariance  holds  for 
an  intera^ion  then  the  effects  due  to  parity  violation  can  be  observed  only  if  the  charge 
conjugation  invariance  is  equally  violated.^  In  the  table  below  are  presented 
results  which  follow  from  the  above  discussion. 


Observable 

Example 

Mean  value 
involves 

Violates 

invariance 

of 

Invariant 
under 

0+  Scalar 

Pi   Pi 

ReCfC,. 

_ 

P,C,  T 

0+  Scalar 

(Pe'Pv) 

Im  C*Cf 

C,  T 

P 

0_  Scalar 

O'iPlXPi) 

Im  C*  Cf 

C,  T 

P 

0_  Scalar 

o-iPixpi) 

Re  Cf  Cf 

— 

P,C,  T 

0+  Pseudoscalar 

(cr.p) 

ReC*Cf 

P,  C 

T 

0+  Pseudoscalar 

ia-p) 

Im  C*Cf 

P,  T 

C 

0_  Pseudoscalar 

{Pl'{P2XPz)) 

ImCfCf 

P,  T 

C 

0_  Pseudoscalar 

(Pi-ip^xpz)) 

ReCfCf 

P,C 

T 

(c)  Charge  Conjugation:  In  view  of  the  invariance  under  the  combined  operations 
CPT  the  conclusions  regarding  the  invariance  under  C  can  be  drawn  from  those 

1  T.  D.  Lee,  C.  N.  Yang  and  R.  Oehme,  Phya,  Rev.  106,  340  (1957).  Note  that  this  theorem 
holds  only  in  the  absence  of  final  state  interaction. 
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regarding  P  and  T.  Landau,-*^  has  postulated  that  weak  interactions  remain 
invariant  under  GP  or  in  other  words  time  reversal  in  variance  holds.  Invariance 
under  CP  guarantees  that  a  mirror  reflection  of  a  state  of  particles  is 
always  a  possible  state  of  the  corresponding  antiparticles.  For  example,  if 
the  ^  particles  emitted  in  the  decay  of  a  radioactive  nucleus  have  a  particular 
angular  distribution,  then  the  same  angular  distribution  should  exist  for  j5+ 
particles  emitted  in  the  decay  of  the  corresponding  antinucleus.  It  is  interesting 
to  note  that  in  this  case  it  is  not  possible  to  define  a  right-handedness  from  ^ 
decay  alone  unless  we  can  distinguish  matter  from  anti-matter.  The  results  of 
all  experiments  so  far  have  upheld  this  invariance. 


Tiolation  of  P  and  C  invariance  in  weak  interactions 

Having  outlined  the  general  consequences  of  the  three  operations  CPT  we 
shall  now  discuss  in  some  detail  the  experimental  evidence  for  the  violation  of  in- 
variance under  P  and  G.^  The  first  demonstration  of  the  violation  of  parity 
conservation  came  from  the  now  famous  experiment  due  to  Wu  et  al?  On 
the  suggestion  of  Lee  and  Yang,*  they  studied  the  asymmetry  in  the  angular 
distribution  oi  p~  particles  emitted  from  oriented  Co^°  radioactive  nuclei.  The 
reaction  is 

Co«>_>Ni«o+e-  +  r         J  =  5-^J  =  4,  AJ^l{NO),  (29) 

which  obeys  the  Gamow-Teller  selection  rule.  Hence  we  denote  the  nuclear 
matrix  element  by  Mq.j,.  Similarly  the  matrix  element  corresponding  to  the 
Fermi  selection  rule  {AJ  =  0{NO)  and  0  ->  0)  will  be  denoted  by  Mp .  Two  gamma 
rays  are  emitted  when  the  exited  Ni^^  reaches  its  ground  state.  The  anisotropy 
of  gamma  rays  was  used  as  a  measure  of  the  polarization  of  Co^^  nuclei.  The  electrons 
were  found  to  be  emitted  preferentially  in  a  direction  antiparallel  to  nuclear 
orientation,  i.e.  the  angular  distribution  /  (6)  of  the  radial  vector,  p^  (the  momentum 
of  the  electron)  was  asymmetric  about  the  axial  vector  J  (the  direction  of  polari- 
zation of  the  nucleus)  where  d  is  the  angle  between  them.  This  implies 

f{6)^f{7t-d),  (30) 

Or  equivalently  F(d)  the  angular  distribution  on  space  inversion  is  different 
from  f{d)  since 


F{d)  =  f{jT-6). 


(31) 


Lee  and  Yang^  have  shown  that  f{d)  is  of  the  form  f{d)=  1  +ocQosd  with 


1  L.  Landau,  Nucl.  Phys.  3,  127  (1957). 

2  See  Proceedings  of  Rehovoth  Conference  on  Nuclear  Structure,  North  Holland  (1958).  Also 
Rev.  Mod.  Phys.  31  (1958). 

3  C.  S.  Wu,  E.  Ambler,  R.  W.  Hayward,  D.  D.  Hoppes  and  R.  P.  Hudson,  Phys.  Rev. 
105,  1413  (1957). 

^  T.  D.  Lee  and  C.  N.  Yang,  Phys.  Rev.  104,  254  (1956), 
5  T.  D.  Lee  and  G.  N.  Yang,  loc.  cit. 
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where 


and 
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/5  =  Re 


ncp 


..f 


|={|Cj.p+|C^p+  IC'rl^  +  \CU\'}\Mgr 


(32) 


(33) 


1  J/,,.j.  p  represents  the  square  of  the  nuclear  matrix  elements.  Many  authors^ 
have  calculated  «  for  other  transitions.  For  J  ->■  J  +  1 , 


a=-^ 


^  +  1  • 


For 

with 


(CJCr  +  C^*Cr-CK:4 


^F^5] 


2? 


(^5  ^A  +  ^5  *  ^-1  "~  ^F  ^T 


C'y* 


Ct) 


XV, 


2MpMgt 


(34) 
(35) 

(36) 


In  all  these  equations  the  plus  sign  corresponds  to  e"  emission  while  the  minus 
sign  corresponds  to  e"*"  emission.  In  deriving  (33  to  36)  it  has  been  assumed  that 
the  nuclear  matrix  elements  Mp  and  Mg  t  ^^^  real  as  a  consequence  of  the  in  variance 
under  time  reversal.  ^  2 

It  was  found  by  Wu  et  al.  that  ^  ^  —0.7  for  ~  ^  0.65.  For  Ni«o,  -^  =  0.2 

and  the  second  term  has  an  upper  limit  0.23  and  hence  can  be  neglected.  Thus  the 
negative  asymmetry  can  be  explained  by  either  assuming  C^  ~  — Cy  or  C^  ^  +  C^ . 
We  shall  discuss  below  this  important  question  of  coupling  in  ^  decay.  From  the 
table  we  also  see  that  the  observation  of  this  asymmetry  establishes  that  charge 
conjugation  invariance  also  does  not  hold.  From  the  expression  for  ^  given  by  (35) 
we  see,  that  there  exist  interference  terms  between  j^  and  GT  transitions.  While 
the  asymmetry  is  quite  consistent  with  a  pure  O  T  transition  for  Co^°,  it  provides 
no  evidence  for  interference  in  Co^®.  However  for  Sc*^  the  asymmetry  value  can 
be  explained  only  by  full  interference.^ 

It  is  possible  to  prove  parity  violation  if  experiments  are  performed  even  with 
unpolarized  nuclei.  It  is  sufficient  if  we  observe  the  recoil  nuclei  to  be  partially 
polarized  since  in  such  a  case  the  state  of  polarization  of  the  recoil  nuclei  relative 


1  R.  B.  Curtis  and  R.  R.  Lewis,  Phys.  Rev.  107,  1387  (1957);  J.  D.  Jackson,  S.  B.  Trei- 
MAN  and  H.  W.  Wyld  Jr.,  Phys.  Rev.  106,  517  (1957);  M.  Morita  and  R.S.Morita,  Phys. 
Rev.  107,  1316  (1957). 

2  See  C.  S.  Wxj,  Proceedings  of  the  Rehovoth  Conference  on  Nuclear  Structure,  p.  346,  North 
Holland  (1958). 

*  Here  h^  =  2-/  Re  [CJ  C^  +  C'^C^^]  |  M^rl'     and     y  =  [\-  Z^e']i. 
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to  the  direction  of  the  electron  momentum  is  not  invariant  under  space  inversion. 
The  partial  polarization  of  recoiling  nuclei  can  be  detected  through  the  polarization 
of  the  y-rays  emitted  by  these  nuclei  which  depends  on  the  angle  between  the 
direction  of  the  y-ray  and  Pg .  The  angular  distribution  of  polarized  y-rays  as  has 
been  calculated  by  Alder,  Stech  and  Winter/  is  of  the  form 

I{d)  =  l±A~GO&d,  (37) 

c 

where  the  plus  sign  stands  for  right  circular  and  the  minus  for  left  circular  polari- 
zation. 

The  circular  polarization  of  the  y-ray  can  be  detected  by  utilizing  the  dependence 
of  the  cross-section  for  Compton-scattering  of  polarized  electrons  on  the  state  of 
circular  polarization  of  the  incident  y-ray.^  The  conclusions  from  the  experiments 
on  the  polarization  of  y-rays  are  consistent  with  those  based  on  ^  asymmetry. 

Many  experiments  have  been  suggested  to  test  the  invariance  under 
time  reversal.  The  Z-dependent  term  in  the  asymmetry  can  be  used  but  the 
contribution  of  this  term  is  very  small.  From  the  table  we  see  that  tests  for  time 
reversal  invariance  can  be  made  by  looking  for  the  dependence  on  triple  scalar 
products  in  the  /5-decay  distribution  functions.  For  example,  the  electron  neutrino 
correlation  from  oriented  nuclei  J  -{pe  x  q^)  was  suggested  by  Jackson  et  al} 
The  coefficient  of  this  term  is  proportional  to 

lixi(CsCT  +  C'sC^-CyC%-C'yCl)\MF\\MoT\.  '    (38) 

Experiments  on  the  electron-neutrino  correlation  from  oriented  neutrons*  carried 
out  indicate  that  the  coefficient  has  a  value  0.02  ±  0.28  which  can  be  interpreted 
as  confirming  time  reversal  invariance  for  ^  decays.  Another  method  has  been 
suggested  by  Lewis  and  Curtis^  which  depends  on  the  interference  between  T 
and  S  OT  A  and  V  in  the  first-forbidden  p  decays.  This  term  gives  rise  to  a  trans- 
verse polarization  of  the  /9-particles  from  unoriented  nuclei  which  can  be  studied 
through  the  term  {a  •  (pe  x  hy)){Pe  '  ^v)-  ^  third  method  is  to  measure  ^  —  v 
correlations  from  polarized  nuclei,  the  quantity  measured  being  «f^-(|>g  x  k^).  The 
coefficient  depends  on  the  product  \Mp  \Mqj,  |  and  the  effect  of  this  term  will  be 
small  if  the  value  \Mp  \MQrf  \  is  very  nearly  zero.  In  fact,  the  experiment  on  Co^® 
has  indicated  that  the  effect  of  this  term  is  not  present.  But  in  view  of  the  un- 
certainties in  the  interpretation  of  ^  asymmetry  data  from  Co^^  experiment 
nothing  conclusive  can  be  drawn  from  the  absence  of  this  term.  The  experiments 
performed  so  far,  while  lacking  in  precision,  still  give  no  evidence  of  any  specta- 
cular violation  of  time  reversal  invariance.  Thus  we  can  conclude  that  ^  decay 
processes  are  left  invariant  under  CP  sls  postulated  by  Landau  and  others.^ 

1  K.  Alder,  B.  Stech  and  A.  Winter,  Phys.  Rev.  107,  728  (1957). 

2  S.  P.  GuHST  and  L.  A.  Page,  Phys.  Rev.  92,  970  (1953). 

3  J.  D.  Jackson,  S.  B.  Treiman  and  H.  W  .  Wyld,  Phys.  Rev.  106,  517  (1957). 
^  BuRGY  et  al.  Phys.  Rev.  107,  731  (1957). 

'"  R.  B.  Curtis  and  R.  R.  Lewis,  loc.  cit. 
*  L.  Landau,  loc.  cit. 
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Suggested  Experiments  for  testing  nan-invariance  under  T  in  decay  experiments 
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due  to  the  finite 
size  of  the  nucleus. 
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Polarization  of 
electrons  in  the 
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ed //-mesons  de- 
tected by  Mott 
scattering 

A,V 

UCy^C^,  then 
the  effect  will  be 
difficult  to  detect. 

KOTANI 

It  is  easy  to  see  that  the  use  of  such  right-  and  left-handed  particles  and  anti- 
particles  naturally  fits  in  with  the  non- conservation  of  parity  and  charge  con- 
jugation. For  under  P,  we  invert  the  momentum  of  a  neutrino  but  not  its  spin 
direction  and  thus  a  right-handed  neutrino  will  become  a  left-handed  neutrino 
which  is  a  forbidden  state  for  the  neutrino.  Thus  the  theory  violates  in  variance 
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under  P.  Since  G  only  changes  the  particle  into  its  anti-particle  without  changing 
its  spin  or  momentum,  we  see  that  a  right-handed  neutrino  will  become  under  C, 
a  right-handed  anti-neutrino  which  is  again  forbidden.  Hence  the  theory  is  not 
invariant  under  C.  However  the  theory  is  left  invariant  under  CP. 

We  have  already  remarked  that  the  negative  asymmetry  in  the  Co®^  experiment 
is  compatible  with  O^  ^  ±  CI  with  —  for  T,  S  and  +  for  A  and  V.  If  an  anti- 
neutrino  is  emitted  along  with  ^",  it  should  be  left-handed.  Consider  the 
transition  in  Co®^  where  T^^  5 -^  T^  =  4,  AT^^  — 1.^  If  both  the  electrons  and 
the  anti-neutrinos  are  emitted  predominantly  parallel  to  each  other  in  the  — z 
direction  there  is  conservation  of  angular  momentum  since  both  are  left-handed 
particles.  Of  course,  this  negative  asymmetry  can  be  explained  also  with  different 
assumptions  regarding  CI '  s  which  will  be  clear  from  the  following  discussion  on 
the  longitudinal  polarization  of  /5-particles  in  the  two-component  theory.^ 

To  see  how  the  /5-particles  acquire  longitudinal  polarization,  let  us  consider 
the  coupling  between  the  neutrino  and  the  electron  fields  given  by 

feOi(l-y,)w.,.  (38) 

Use  of  commutation  relations  gives  us 

Oir,  =  ±  y,0,  with  +  for  i  =  S,P,T 

—  for  *  =  A,  V. 
Thus  we  obtain 

v^e(i=Fr5)^irr.  (40) 

For  highly  relativistic  electrons  the  mass  can  be  neglected  and  as  for  the  neutrino 
we  have  100  per  cent  right-handed  electrons  for  V  and  A  and  100  per  cent  left- 
handed  electrons  for  S,  T  and  P.  The  positron,  being  the  anti-particle,  will  possess 
the  opposite  handedness.  It  is  important  to  notice  that  the  handedness  of  the  ^- 
particle  depends  both  on  the  handedness  of  the  neutrino  and  the  type  of  ^  coupling. 
However  we  can  measure  experimentally  the  longitudinal  polarization  of  the  ^- 
particles  irrespective  of  such  a  knowledge  and  this  information  can  be  used  as 
a  tool  to  reveal  the  type  of  the  coupling  responsible  for  the  ^  decay.  For  j5- particles 
moving  with  velocity  v,  we  can  easily  show  that  the  helicity  (a  •  p)  will  be  equal 
±  vjc  with  +  for  V  and  A ,  and  —  for  S,  T  and  P. 

A  large  number  of  experiments  have  been  performed  to  measure  the  longi- 
tudinal polarization  of  j5+  particles  emitted  in  the  pure  Gamow-Teller,  pure  Fermi, 
mixed  and  forbidden  transitions.^  Generally  an  electrostatic  deflector  converts  the 
initial  longitudinal  polarization  of  the  particles  into  transverse  polarization  which 
is  then  detected  by  the  left-right  asymmetry  due  to  spin-orbit  forces  in  the  ordinary 


1  T.  D.  Lee  and  C.  N.  Yang,  105,  1671  (1957);  A.  Salam,  Nuovo  Cim.o,  297  (1957);  and 
L.  Landau,  Nuclear  Physics  3,  127  (1957). 

2  T.  D.  Lee  and  C.  N.  Yang,  Elementary  Particles  and  Weak  Interactimis,  47,  Brookhaven 
National  Laboratory  (1957). 

3  See  "Measurements  of  Hehcity"  by  Lee  Grodzins,  Progress  in  Nuclear  Physics,  Vol.  7, 
p.  165,  Pergamon  (1959). 
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Coulomb  scattering.^  Other  methods  devised  to  measure  the  longitudinal  polari- 
zation are  based  on  the  fact  that  the  Moller  scattering  cross-section  depends 
sensitively  on  the  polarization  of  the  scattering  electrons.*  The  high  energy  brems- 
strahlung  of  such  polarized  electrons  inherits  the  polarization  and  the  observation 
of  the  circular  polarization  of  such  y-rays  constitutes  another  method  for  estim- 
ating the  longitudinal  polarization.'  For  positrons,  Page  and  Heinberg*  in 
their  experiments  exploit  the  spin-dependent  features  in  the  two-photon  annihila- 
tion in  flight  of  a  polarized  positron  with  a  polarized  electron  (available  in  magnet- 
ized iron).  With  an  accuracy  of  10  per  cent  all  data  are  consistent  with  a  longi- 
tudinal polarization  of  the  maximum  amount  ±  vjc  for  ^'^  emission.  This  imphes 
the  following  choice  of  coupling  constants  and  handedness  of  the  neutrino 

Cs=  — G's'y  Crp  =  — C't  for  a  right-handed  neutrino 

(41) 
Cy  =  Cy ;      C^  =  C^J^      for  a  left-handed  neutrino . 

A  very  ingenious  experiment  due  to  Goldhaber  et  al}  indicates  that  the  neutrino 
is  left-handed.  In  this  experiment  the  nucleus  with  spin  T  ^  0  (Eu^^^)  decays 
by  a  iC  electron  capture  to  an  excited  nucleus  B  with  T  =^  I  (Sm^^^)  emitting 
a  neutrino.  If  the  neutrino  is  emitted  along  the  z-axis,  then  the  Tz  value  of  B 
is  —  1  or  0  for  positive  helicity  of  the  neutrino  and  +  1  or  0  for  negative  helicity 
of  the  neutrino.  The  recoiling  nucleus  B  reaches  the  ground  state  by  the  emission 
of  a  y-ray.  In  either  case,  the  helicity  of  the  y-ray  emitted  along  the  — z  axis  is 
the  same  as  that  of  the  neutrino.  The  preferential  detection  of  y-rays  emitted 
along  the  recoil  direction  is  accomplished  by  use  of  resonance  fluorescence  detec- 
tion which  is  sensitive  only  to  those  photons  whose  Doppler  shift  due  to  the 
motion  of  the  excited  recoil  nucleus  almost  compensates  for  the  sharing  of  the 
transition  energy  between  the.  y-ray  and  the  final  nucleus.  Thus  the  measurement 
of  the  circular  polarization  of  the  y-rays  which  are  resonantly  scattered  by  the 
nucleus  B  yields  directly  the  helicity  of  the  neutrino.  The  experiment  shows  that 
the  neutrino  is  left-handed. 

From  the  above  discussion  of  the  longitudinal  polarization  and  the  handedness 
of  the  neutrino  we  can  assume  that  the  p  decay  proceeds  through  a  linear  com- 
bination of  V  and  A.  The  experiments  on  ^  decay  from  polarized  neutrons  in- 
dicates that  this  linear  combination  is  V  —  A.  It  is  interesting  to  note  that  this 
conclusion  is  in  agreement  with  {^  —  y)  correlation  experiments*  performed 
recentlv. 


1  H.  Frauenfelder  et  al,  Phys.  Rev.  106,  386  (1957). 

'^  A.  M.  BiNCEF,  Phy9.  Rev.  107,  1434  and  1467  (1957).  See  also  G.  W.  Ford  and  C.  J.  Mul- 
LiN,  Phys.  Rev.  108,  477  (1957). 

»  M.  Goldhaber,  L.  Grodzins  and  A.  W.  Sunyar,  Phys.  Rev.  106,  826(1957);  K.  W. 
McvoY,  Phys.  Rev.  106,  820  (1927);  K.W.  McVoYandF.  J.  Dyson,  Phys.  Rev.  106,1360(1957). 

*  L.  A.  Page  and  M.  Heinberg,  Phys.  Rev.  106,  1220  (1957). 

5  M.  Goldhaber,  M  Grodzins  and  A.  W.  Sunyar,  Phys.  Rev.  109,  1015  (1958). 

*  See  especially  The  Neutrino  by  James  S.  Allen,  ch.  5,  Princeton  (1958). 
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It  was  suggested  by  Lee  and  Yang^  in  their  original  paper  that  the  ji-jn-e  chain 
can  be  used  to  test  parity  non- conservation.  From  the  two-component  neutrino 
theory  it  immediately  follows  that  ^-mesons  obtained  from  the  decay  of  tt^  should 
be  longitudinally  polarized  along  their  direction  of  motion.  Hence  in  the  sub- 
sequent decay  of  the  ju-meson  into  electron  and  v  —  v  pairs,  the  electron  will  have 
an  asymmetric  angular  distribution  if  parity  violation  exists. 

The  jLi  decay  process  can  be  described  by  an  interaction  Hamiltonian  based 
on  the  two  component  theory  of  the  neutrino 


H, 


int 


V,A 


(42) 


where  ^y->  J(l  +  75)^^  corresponds  to  left-handed  neutrino.  The  sum  is  over 
V  and  A  as  the  other  covariants  8,  T  and  P  vanish  identically  due  to  commutation 
properties. 2  Note  that  this  result  is  independent  of  ouf  knowledge  about  nuclear 
decay,  fji^  can  decay  as 

(43  a) 

(43  b) 

(43  c> 


[JL^  ^e 


^  +  V  +  V 

e±  +  2v 


e- 


2  V. 


1  T.  D.  Lee  and  C.  N.Yang,  Phys.  Rev.  104,  254  (1956);  Phys.  Rev.  105,  1671  (1957). 

-  This  result  is  valid  only  in  the  two  component  theory.  If  we  use  four  component  theory, 
by  a  Fierz  transformation,  the  interaction  can  be  made  to  contain  all  the  covariants  instead 
of  just  A  and  V. 

17* 
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MicheP  showed  that  the  energy  spectrum  of  the  decay  electrons  has  the  form 


N(x)  dx  =  4a:a[3(l  —x)+—q{4x  —  3) 


dx,  (44) 


where  x  is  the  electron  energy  in  units  of  its  maximum  value,  g  is  called  Michel's 
parameter  which  has  a  different  value  for  each  of  the  three  schemes  of  decay : 
Q  =  0.75  for  (43a)  while  ^  =  0  for  (43  b)  and  (43  c).  Experiments  indicate 
that  ^  =  .68  ±  0.02.  Thus  (43a)  is  the  only  allowed  decay  scheme  and  this 
has  a  bearing  on  the  principle  of  conservation  of  leptons  to  be  discussed 
below.  The  angular  distribution  of  the  decay  electron  from  polarized  //-meson 
averaged  over  the  energy  distribution  is 


\d^/ 


E 


2_(l±|cose).  (45) 


where  6  is  the  angle  between  p^  and  the  spin  of  the  //-meson  and 

The  upper  sign  corresponds  to  //"  decay  while  the  lower  sign  refers  to  //+  decay. 
This  angular  distribution  was  confirmed  in  an  ingenious  experiment  due  to 
Oarwin  et  al}  A  beam  of  85  MeV  jr+  mesons  was  stopped  in  a  carbon  absorber. 
Some  of  the  //+  from  the  decay  of  jt^  mesons  came  to  rest  in  a  carbon  target  at 
a  short  distance  from  the  absorber.  The  ^  decay  of  //  was  detected  by  electron 
telescopes  which  recorded  electrons  with  energies  >  25  MeV.  It  was  so  arranged 
that  electrons  will  be  recorded  in  an  interval  between  0.75  and  2  //  sec.  after  ju 
had  come  to  rest.  To  record  the  entire  angular  distribution  of  electrons,  the 
//-mesons  were  caused  to  precess  by  means  of  a  small  vertical  magnetic  field. 
The  various  experiments  yield  a  value 

1^1=:  0.87  ±0.13  (46) 

and  show  that  electrons  are  emitted  preferentially  backwards  relative  to  the 
line  of  flight  of  the  //-meson.  The  sign  of  |  cannot  be  known  from  this  experiment 
as  it  depends  on  the  helicity  of  the  //-meson  acquired  when  jt  decayed.  The  helicity 
of  //  is  intimately  related  to  the  principle  of  conservation  of  leptons.  However 
if  we  can  measure  the  helicity  of  the  decay  electrons  from  randomly  oriented 
//-mesons  we  can  find  the  sign  of  i  (cf.  longitudinal  polarization  in  nuclear  p- 

V 

decay).  The  longitudinal  polarization  of  the  decay  electron  is^=±^  —  —  ±$ 

c 

the  negative  sign  referring  to  //"•"  decay.  Experiments  have  shown  that  the  posi- 


1  L.  Michel,  Nature,  Land.  163,  959  (1949);  Proc.  Phys.  Soc.  68  A,  514  (1950);  Phys.  Rev, 
86,  814  (1952). 

2  R.  L.  Gabwin,  L.  M.  Ledermajj  and  L.  M.  Weinrich,  Phys.  Rev.  106,  1415  (1957). 
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trons  from  //+  decay  are  right-handed  indicating  that  |  is  negative. ^  This  implies 
that  f j^^  —fy  SiS  seen  from  (45  a)  which  in  other  words  means  that  the  interaction 
is  F  —  A  which  is  in  remarkable  agreement  with  the  tentative  conclusion  arrived 
at  after  the  discussion  of  nuclear  ^  decay  experiments.  Attempts  have  been  made 
to  improve  the  agreement  between  theoretical  and  experimental  value  of  q 
through  the  inclusion  of  non-local  interaction  in  fj,  decay. ^  The  effects  of  a  radiative 
correction  to  the  electron  energy  spectrum  arising  in  the  /li  decay  have  also  been 
discussed.^ 

In  the  discussion  of  the  ju^  decay  we  have  indicated  the  need  for  a  principle 
of  conservation  of  leptons.*  By  the  mere  use  of  the  two  component  theory  we 
cannot  decide  whether  the  ^  decay  is 

n-->  p  +  e~  +  V 

^p  +  e-  +  V.  (47) 

By  choosing  different  handedness  for  v  and  v  and  also  correspondingly  different 
couplings  we  can  explain  the  present  experimental  data  (Co^°  experiments  etc.). 
However  if  we  allow  both  the  processes  to  have  equal  amplitudes  then  there  will 
be  no  asymmetry  in  the  Co^^  experiment.  We  thus  find  it  necessary  to  invoke 
the  principle  of  conservation  of  leptons.  This  principle  is  analogous  to  the  principle 
of  conservation  of  baryons  and  implies  that  the  number  of  leptons  minus  the 
number  of  anti-leptons  should  be  conserved  in  any  process.  But  this  law  is  ob- 
viously incomplete  without  specifying  which  particles  are  leptons  and  which 
are  anti-leptons.  Let  us  assign  a  leptonic  charge  — 1  to  e+,  /^+  and  v  (anti-leptons) 
and  a  leptonic  charge  +1  to  e~,  v,  /n'.  The  baryons  and  bosons  have  leptonic 
charge  0.  An  immediate  consequence  of  this  is  that  /j,'^  decay  can  proceed  only 
through  (43  a). 

Pion  decay  offers  an  interesting  test  case  for  this  principle .  Consider  jf^  decay. 
This  principle  demands 

:7r+->/^+ +  v.  (48) 

Conservation  of  angular  momentum  requires  that  jn^  be  left-handed  as  the 
neutrinos.^  When  jll  decays,  we  know  that  the  high  energy  positrons  emerge  in 
the  backward  direction  relative  to  the  ^+  meson's  line  of  flight.  Conservation  of 
angular  momentum  requires  that  positrons  should  be  right-handed  as  was  found 
from  experiment.  Thus  with  the  assumption  of  V-A  interaction,  we  find  there  is 


1  G.  CuLLiGAN  et  ah.  Nature,  Lond.  180,  751  (1957). 

2  T.  D.  Lee  and  C.  N.  Yang,  Phys.  Rev.  108,  1611  (1957). 

3  T.  KiNOsmTA  and  A.  Sirlin,  Phys.  Rev.  106,  1110  (1957);  Phys.  Rev.  107,  593  (1957). 
*  E.  KoNOPiNSKi  and  H.  M.  Mahmoud,  Phys.  Rev.  92,  1145  (1953);  see  also  T.D.Lee 

and  C.  N.  Yang,  Phys.  Rev.  105,  1671  (1956). 

^  It  is  interesting  to  note  that  though  leptons  are  left-handed  antUeptons  are  right-handed 
particles  we  can  never  detect  handedness  of  //+  from  7r+  decay  as  the  conservation  of  linear 
momentum  requires  [jl^  to  be  left-handed.  We  recall  that  the  handedness  is  an  invariant 
quantity  under  Lorentz  transformations  only  for  mass  zero  particles,  i.e.  for  particles  moving 
with  the  velocity  of  light. 
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conservation  of  leptonic  charge  in  the  jr  decay.  It  may  be  noted  here  that  if  the 
neutrino  were  assumed  to  be  right-handed,  we  will  have  violation  of  the  principle 
of  conservation  of  leptons  in  the  n  decay. 

The  conservation  of  leptons  has  another  confirmation  from  the  double  and 
inverse  ^  processes.  These  two  processes  can  be  used  to  test  which  of  the  two 
descriptions  of  the  neutrino  namely  the  Majorana^  {v  =  v)  or  the  Dirac  [v  ^  v) 
is  correct.  The  double  ^  decay  as  the  name  indicates  refers  to  a  phenomenon  by 
which  a  nucleus  changes  its  charge  by  two  units  by  ^  decay.  If  the  neutrinos 
are  of  the  Dirac  type,  then  the  processes  are 

n  ->  jo  +  e-  +  r  (49) 

followed  by  ,„^, 

^  w-^/)  +  e-  +  r.  (50) 

Thus,  in  Dirac's  picture  there  should  be  two  neutrinos  and  two  ^  particles. 
However  in  Majorana's  picture  y  =  r  and  we  can  envisage  a  process  involving 
a  virtual  emission  and  absorption  of  a  neutrino.  Thus  the  two  neutrons  change 
into  two  protons  with  the  emission  of  two  electrons,  i.e. 

n  ->p  +  e-  +  r  (51) 

n  +  v^  p  -{■  e-.  (52) 

In  contrast  with  the  continuous  energy  spectrum  of  electrons  emitted  with  Dirac's 
neutrinos,  the  sum  of  the  kinetic  energies  of  the  two  electrons  should  be  a  sharp 
line  spectrum  for  the  Majorana  neutrino.  It  is  clear  that  the  Majorana  view  of 
identical  v  and  v  violates  the  principle  of  conservation  of  leptonic  charge  while 
Dirac^s  view  upholds  it.  As  there  is  very  little  evidence  for  the  presence  of  double  ^ 
decay,  we  can  conclude  that  this  confirms  the  principle  of  the  conservation  of 
leptons. 

The  very  difficult  inverse  ^  reactions  have  also  been  experimentally  studied.^ 
If  V  and  V  are  distinct,  the  following  inverse  ^  processes  are  possible 

V  +  ^^71  +  e^  (53) 
r  +  w->^  +  e".                                                  (54) 

while  if  V  and  v  are  identical,  the  following  are  also  possible 

V  +  ??  ->  w  +  e+  (55) 

V  -{-  n ->  jp  +  e~ .  (56) 

The  cross-section  for  such  processes  can  be  estimated  by  perturbation  theory 
and  they  are  of  extremely  small  value  ('^10'*^  cm^).  If  the  principle  of  conser- 
vation of  leptonic  charge  is  correct,  the  nuclear  reaction  pile  should  be  a  source 
of  anti-neutrinos.  Considering  the  inverse  reaction^ 

i^CP^  +  r  ->  isAF  +  e-  (57) 

1  See  chapter  VII  of  The  Nevirino  by  James  S.  Allen,  loc.  cit. 
-   R.  Davies,  Phya.  Rev.  97,  766  (1957). 
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we  find  that  if  the  Dirac  description  of  neutrino  is  correct,  these  anti-neutrinos 
should  not  disintegrate  CP'.  Experiments  have  been  performed  and  an  upper  limit  of 
9  X  10~^^  cm^  was  obtained  for  the  reaction  cross- section.  Cowan  et  al}  have  mea- 
sured the  cross-section  averaged  over  neutrino  spectrum  and  find  Ogxp  =  4.0  ±1-5 
X  10~^*  cm^,  in  close  agreement  with  the  theoretical  value  6  x  10~**  cm^.  In 
calculating  this  we  have  assumed  that  the  statistical  weight  factor  is  2  correspond- 
ing to  spin  1/2.  However  if  the  "two -component"  neutrino  theory  is  correct,  we  find 
the  interesting  situation  that  all  the  anti-neutrinos  emerging  from  the  reactor 
have  only  one  spin  state  in  which  they  can  be  absorbed.  It  is  to  be  noted  however 
that  the  neutrino  flux  is  independent  of  two  or  four  components.  Thus  the  "spin 
weight"  becomes  1  and  hence  the  cross-section  gets  doubled.  This  means  the 
experimental  value  of  neutrino  absorption  found  out  by  Cowan  et  al.  is  smaller 
by  a  factor  as  the  theoretical  cross-section  is  12  X  10"**  cm^.  At  any  rate,  the 
negative  results  of  Da  vie  s^  can  be  taken  to  confirm  the  principle  of  conservation 
of  leptons. 

Pauh^  has  pointed  out  that  conservation  of  leptons  is  more  complicated  than 
what  the  above  discussion  implies.  For  even  as  in  parity  conserving /5  decays,  we  can 
have  violation  to  any  degree  (instead  of  "yes"  or  "no")  of  the  conservation  of  lepton 
number  in  a  /5  interaction  of  the  form 

^int  =  E  {%  Oi  Wn)  \Cj(y>,  Oi  xp^)  +  Cn  (%  ^i  V^?)]  +  ^  •  ^  •  (58) 

i 

Conservation  of  lepton  number  implies  that  either  Cj  or  Cn  vanishes,  and  maxi- 
mum violation  occurs  when  |C/|  =  |Cjj|.  It  is  interesting  to  compare  the  same 
situation  when  there  is  violation  of  parity.  Allowing  such  violation  in  the  inter- 
action, we  get  in  all  four  constants  Cj,  Q,  Cjj  and  Cji,  two  for  the  parity - 
conserving  part  and  two  for  the  parity  non- conserving  part.  Only  experiments 
can  decide  the  presence  or  absence  of  such  constants.  It  has  been  pointed  out  by 
Case*  that  violation  of  parity  and  occurrence  of  double  p  decay  anticommute 
with  each  other,  which  implies  that  if  there  is  maximum  violation  of  parity  there 
is  very  little  of  double  decay.  According  to  Case,^  mere  use  of  the  two  component 
theory  will  not  cause  violation  of  parity  but  the  specific  form  of  the  weak  inter- 
action in  that  the  ip^  should  occur  only  multiplied  by  (I  — y^)  or  (1  +  yg)  causes 
violation  of  parity.  Physically  this  means  that  only  one  spin  state  of  the  Majorana 
neutrino  is  operative  in  emission  or  absorption.  Thus  if  only  left-handed  neu- 
trinos are  emitted,  only  right-handed  neutrinos  (or  anti-neutrinos  as  we  call  them) 
can  be  absorbed  to  produce  the  same  reaction. 

The  decays  of  strange  particles  offer  possibilities  of  testing  parity  violation. 
Among  the  various  modes  of  decay  of  the  K  particle  the  K^^,  2  decay  is  very  important 

1  C.  L.  CowAN  et  al..  Science,  124,  103  (1956). 

2  R.  Davies,  loc.  cit. 

3  W.  Pauli,  Numo  Cim.  6,  204  (1957);  D.  L.  Pursey,  Nuovo  Cim.  6,  266  (1957). 

*  K.  M.  Case,  Phys.  Rev.  107,  307  (1957) ;  J.  Serpe,  Physica  18,  295  (1952) ;  J.  A.  Mclennen, 
Phys.  Rev.  106,  821  (1957). 
^  K.  M.  Case,  ibid. 
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in  that  it  tests  not  only  violation  of  P  and  C  but  also  tests  the  principle  of  con- 
servation of  leptons.  It  has  been  found  that  the  electrons  from  the  subsequent 
/i-meson  decay  have  the  same  distribution  as  in  :7r  decay,  as  it  should  be. 

We  now  refer  to  discussions  by  Lee  and  Yang^  on  the  study  of  parity  violation 
through /1°  and  2*=^*®  decay.  Consider  A^  produced  through  (n'  —  p)  collisions 

jr- +  p->/lo+ A'o.  (69) 

If  in  the  production  both  S  and  P  waves  are  important,  then  there  arises  the 
possibility  that  A^  will  be  polarized,  the  polarization  direction  being  given  by 
(Pn  *^  X  p^),  i.e.  perpendicular  to  the  plane  of  production,  if  strong  interactions 
conserve  parity.  The  decay  ;r's  in  the  subsequent  decay  of  A^  can  have  an  asym- 
metrical angular  distribution  about  the  direction  \p^^^  x  Pa)  as  in  the  ^  decay 
from  oriented  nuclei.  This  means  that  there  will  be  an  "up-down"  asymmetry 
about  the  plane  of  production.  In  the  centre  of  mass  system  of  A^  and  K^  the 
asymmetry  is  given  by  (assuming  that  A^  has  spin  V2  a-nd  K^  has  spin  zero) 

dw  pT 

d^~l  +  «<«.>g.|.  (60) 

<<y^>  is  the  average  polarization   of  A^,  p^"*^  the  momentum  of  the   outgoing 

pion  and  a  the  asymmetry  parameter.  If  A^  has  spin  ^/g,  parity  violation  allows 

the  emission  of  both  S  and  P  wave  pions  and  if  ^  and  B  denote  the  relative  ampli- 

-2ReAB* 
tudes  for  S  and  P  wave  emission  oc  =   ,  .  ,^ v-^ .  A  large  up  down  asymmetry 

in  the  A^  decay  has  been  found, ^  i.e. 

(X  |<{J'^>|=  0.44  ±  0.11.  (61) 

Similar  experiments  on  Z~  production  and  its  decay  have  shown  that  very  little 
asymmetry  exists.* 

We  shall  now  discuss  the  application  of  the  |ZI/|  =  V2  ^^^  ^o  hyperon 
decays.  For/1®  decay,  the  final  pion  and  nucleon  should  possess  /  =  ^/2.  From  the 
isotopic  wave  function  of  the  combined  n  —  N  system  corresponding  to  /  =  ^/2, 

1  T.  D.  Lee  and  C.  N.  Yang,  Phys.  Rev.  104,  254  (1956);  105,  1671  (1957). 

2  F.  EiSLER  et  al.,  Phys.  Rev.  108,  1353  (1957);  F.  S.  Crawford  et  al.,  Phys.  Rev.  108, 
1102  (1957);  BoLDT  et  al.,  Phys.  Rev.  Lett.  1  356  (1958);  R.  Gatto  (see  Phys.  Rev.  108,  1103 
(1957))  has  shown  using  t\ie  n—p  scattering  phase  shifts  at  37  MeV  that  a  ^  0.18  ±  0.02  for 
charge  conjugation  invariance  to  hold  good. 

Lee  and  Yang  (see  Phys.  Rev.  100,  1755  (1958))  have  pointed  how  the  spin  of  A^  can  be 
determined  from  the  large  asymmetry  of  its  decay  products.  They  have  shown  that  if  <^) 
denotes  the  average  value  of  the  component  of  p„  along  the  normal  to  the  production  plane 
(in  units  of  100  MeV/c) 


^<l>^ 


2; +  2  -^^'  -   2j  +  2 

where  j  is  the  spin  of  yl".  The  large  asymmetry  indicates  j  =  ^/g  and  \oi.\  now  equals  3  <|>. 
It  has  also  been  pointed  out  that  la  |  uniquely  depends  on  the  spin  of  A^  if  we  study  the  decay 
of  completely  polarized  yl",  i.e.  we  set  |P/io|  =  1. 
3  Cool  et  al.,  Phys.  Rev.  114,  912  (1959). 
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we  find  that  the  7t~  +  p  configuration  is  present  with  the  probability  2/3.  Thus  the 
fraction  of  charged  decays  of  A^  as  contrasted  with  its  neutral  decay 
mode  A^^Tt^  +  nis  given  by  ^/g.  This  is  in  very  good  agreement  with  the  reported 
experimental  value  of  0.65.  ±  0.05. 

Z'^  can  decay  into  two  channels,  i.e. 

i:+-^p  + 71^  (62) 

->n  +  7i~ 

while  U-  can  decay  into  n  +  n'  only.  |zl  /]  =  V2  rule  for  Z+  ^  decay  allows  both 
/  ==  1/2  and  I  =  ^/g  for  the  final  pion  and  nucleon  states  in  arbitrary  proportion. 
Gatto  and  others  have  estimated  the  fraction  of  i7''"  decays  into  n  +  jf^  system  using 
the  pion-nucleon  scattering  phase  shifts.  Z"  decays  necessarily  into  the  /  =  ^/^ 
state.  With  the  |/4  /|  =  ^/2  rule,  it  may  be  shown  that  a  relationship  exists  between 
this  transition  and  27"*"  transition. 

Denoting  by  A+,  A,  A^  the  amplitudes  of  decay  into   n7i+,   nji'    and  pn^ 
respectively,  we  have  the  remarkable  relation 

^"-^-^  (63) 

If  time  reversal  invariance  holds  then  we  can  show  that  these  decay  amplitudes 
will  involve  pion-nucleon  scattering  phase  shifts  due  to  the  final  state  interactions 
but  the  amplitude  can  still  be  treated  as  real  numbers  since  the  phase  shifts  will 
be  small  at  these  energies.  Remembering  that  parity  non- conservation  allows  both 
S  and  P  waves  for  each  amplitude,  we  can  treat  with  Gell-Mann^  each  amplitude 
as  a  vector  A  with  A^  and  A^  as  the  x  and  y  components.  The  decay  rates  will  be 

I  4  +  12     I     I   J-|2 

Q^^lA-'l^;         Q-=\A-\^;         i^^l^oj^^  '^  '    ^'^^-A^-^-.     (64) 
However  we  know  from  experiments  that 

^^^V2;  ^■'^^^'^1  (0.96  ±0.06)  (65) 

Xz-  pTT 


Thus  we  have 

and  the  interesting  result 


Q+^Q-—Q^  (66) 

A-'-A-^O         |^+|=|.4-p,  (67) 

i.e.  the  vectors  A'^  and  A~  are  orthogonal  to  each  other  and  are  of  equal  length. 
We  can  express  the  various  amplitudes  as  follows. 

A-^^  =  CGO^p;     A  +  ^  =  csin^ 

A-^  =  —csinP;     J[-^  =  c  cos/5  (68) 


1  R.  Gatto,  Nuovo  Cim.  3,  318  (1956);  B.  D'Espagnat  and  J.  Prentki,  Nuovo  Cim.  3, 
1045  (1956);  M.  KAWAGUcm  and  K.  Nishijima,  Progr.  Theoret.  Phys.  15,  182  (1956). 

2  M.  Gell-Mann  and  A.  H.  Rosenfeld,  loc.  cit.,  p.  206. 
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»^^  40s  _  -c(cos/?+sin/?)  c(co8/?-8in/^) 

^      -  yg  '  ^      ^ j?2 •  ^^^^ 

If  a  is  the  ''up-down"  asymmetry  parameter  discussed  before  in  the  A  decay  the 
a* 8  for  the  three  Z  decays  are 

n7r+ :  a+  =  —  sin  2fi 

nn- :  a"  =  sin  2/5  -.  (70) 

?)7r®:a®  =  cos  2/5 

where  c  and  /5  are  arbitrary  parameters.  Thus  |^/|=  ^U  rule  along  with  CP 
invariance  predicts  a  definite  relationship  between  the  three  asymmetry  parameters. 
Recent  experiments  ^  on  up-down  asymmetry  of  E'^  and  E~  decays  indicate  that 

a+P(i:+)  =  0  (71) 

Of  course  we  must  remember  that  we  can  measure  only  (xP{E)  where  P{E)  is 
the  polarization  of  the  E  particles  when  they  are  produced.  These  results  very 
clearly  indicate  that  P(E'^)  4=  0  and  that  ^+  ^  0.  Thus  /5  is  ?«0  or  nj^-  It  then 
follows  that 

a+^0;    (x^^±\\    (x-^0.  (72) 

The  absence  of  asymmetry  in  27"  decay  may  also  be  due  to  the  fact  that  E~  are 
not  polarized  when  they  are  produced.  At  any  rate  the  predictions  of  the  |zl  / 1  =  ^/g 
rule  seem  to  be  consistent  with  the  experimental  results.  If  we  can  measure  the 
longitudinal  polarization  of  the  protons  emitted  in  E^  and  A^  decays,  then  we 
can  check  whether  there  exists  any  relation  between  the  asymmetries  oc^  and  oca 
which  should  have  opposite  signs  under  the  doublet  approximation  of  baryons. 
Finally  applying  the  |/4/|  =  '^U  rule  for  E~  and  E^  decays,  we  find  that 
T^-/Tff.  =  ^/g  for  the  ratio  of  average  lifetimes  of  these  hyperons.  However  if  the 
transition  is  a  pure  |zl/|  =  ^/g  transition,  this  ratio  is  2.  This  ratio  has  not  been 
verified. 

2.  UNIVERSAL  FERMI  INTERACTION 

Long  before  the  recent  developments  in  weak  interactions  Wheeler  and  Tiomno^ 
had  noticed  the  remarkable  fact  that  the  coupling  constants  in  the  three  processes, 
viz.  /5  decay,  [j,  capture  and  //  decay  are  almost  equal.  Since  all  the  particles 
involved  in  these  processes  are  only  fermions,  this  equality  suggested  the  attrac- 
tive posssibility  of  a  Universal  Fermi  Interaction  (UFI)  which  can  be  represented 
by  the  triangle  below  (usually  referred  to  as  Puppi's^  triangle). 


^  Cool  et  al.,  loc.  cit. 

2  J.  TiOMNO  and  J.  A.  Wheeler,  Rev.  Mod.  Phys.  21,  144,  153  (1949). 

3  G.  Puppi,  Nuovo  Cim.  5,  587  (1948). 


WEAK    INTERACTIONS  267 

The  three  sides  of  the  triangle  C.  E  and  D  refer  respectively  to  ^  decay, 
ju  decay  and  fj,  capture.  By  UFI  it  was  not  only  meant  that  the  coupling  strengths 
should  be  the  same  but  also  the  form  of  the  interaction.  Though  it  was  known 
that  jLt  decay  should  proceed  through  A  and  V  coupling,  the  'old"  electron- 
neutrino  angular  correlation  experiments  gave   conflicting  evidence  regarding 


Fig.  36.  (a)  Puppi  triangle. 

the  couplings  responsible  for  ^  decay.  The  evidence  prior  to  the  recent  experi- 
ments on  the  longitudinal  polarization  of  ^  particles  emitted  in  ^  decay  was 
interpretisd  to  support  the  view  that  only  S  and  T  coupling  induced  ^  decay. 
Under  these  circumstances,  no  definite  progress  could  be  made  regarding  the 
validity  of  UFI.  However  the  experiments  on  the  longitudinal  polarization  of  /5- 
particles  have  shown  that  the  couplings  present  in  ^  decay  are  A  and  V  and  thus 
the  Universal  Fermi  Interaction  was  taken  to  be  a  valid  hypothesis.  We  shall 
show  presently  that  the  UFI  can  be  derived  from  more  general  grounds  consistent 
with  the  new  experimental  facts  on  weak  interactions,  viz.  parity  non- conser- 
vation and  violation  of  invariance  under  charge  conjugation. 

Before  we  take  this  up  for  discussion  let  us  consider  the  possibility  of  the  modi- 
fication of  Puppi's  triangle  to  accommodate  other  weak  processes  like  the  decays 
of  K  particles  and  hyperons.^  Other  processes  involving  fermions  in  Puppi's 
triangle  can  be  obtained  by  reversing  the  reaction  arrow  and  replacing  the  particle 
by  its  antiparticle  when  the  particle  is  transferred  from  one  side  of  the  arrow  to 
the  other.  Thus  we  obtain  from  the  ^  decay  of  a  neutron,  the  K  electron  capture 
by  a  proton  in  a  nucleus 

e"  -j-  p  ->  71  -[-  V  (1) 

and  from  the  ^  decay  of  a  proton  in  a  nucleus,  the  process 

v  +  ^  ->  w  +  e+.  .  (2) 

The  latter  can  be  used  to  detect  the  anti-neutrino.  It  is  clear  that  Puppi's  triangle 
can  never  account  for  the  decays  of  strange  particles.  Dallaporta  and  GrcU-Mann^ 


^  N.  Dallaporta,  Nuovo  Cim.  1, 962  (1953) ;  M.  Gell-Manx,  Proc.  6th  Rochester  Conference, 
Interscience,  New  York  (1956). 
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have  independently  discussed  the  tetrahedron  representation  of  the  four-fermioii 
interactions  shown  below. 

The  tetrahedron  was  obtained  by  adding  to  the  triangle  the  vertex  containing 
pairs  like  pA,  pl^  etc.  into  which  K^  can  decay  and  including  in  the  pn  vertex 
pairs  like  2^^A,AZ~  into  which  n'^  can  decay.  The  two  arrows  represent  virtual 
dissociation.  We  can  easily  verify  that  the  couplings  between  different  vertices 


pn,  I"    A,  etc. 


P^  pr°  etc. 


Fig.  36.  (b)  Tetrahedron  representation  of  four  fermion  interaction  of  positively 

charged  pairs. 


account  for  all  the  various  decay  modes  of  hyperons,  .^-mesons  and  pions.  For 
example,  the  pionic  decay  modes  of  iT-mesons  are  described  by  the  side  F  of  the 
tetrahedron.  In  this  picture  we  can  have  only  the  following  decays  of  neutral 
A'- mesons,  i.e. 

A^  ->  e-  +  V  +  7r+  or  ^-  +  r  +  7i;+  (3) 

A'o  ->  e+  +  r  +  TT-  or  /^+  +  i;  +  71" .  (4) 

The  consequences  of  this  in  the  decays  of  K\  and  K\  have  been  discussed 
earlier  (see  p.  224).  The  tetrahedron  scheme  also  prohibits  the  processes 
//- ^  e^  —  e- -I- e^ ;  yU'^(/^~)^-e'*'(e")  +  r(i')  which  are  known  to  be  vary  rare 
or  absent.  If  we  wish  to  include  a  Fermi  interaction  between  pairs  of 
particles  of  total  strangeness  — 1    such   as  (i7+,  7i)  etc.  a  fifth  vertex  can  be 
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introduced  to  account  for  the  leptonic  decay  modes  of  hyperons  as  has  been  pointed 
out  by  Gell-Mann.^ 

We  have  seen  that  the  "two-component"  neutrino  theory  was  quite  succesful 
in  explaining  the  violation  of  parity  and  charge  conjugation  invariance  in  decays 
involving  neutrinos  and  we  know  that  a  two- component  theory  is  equivalent 
to  multiplying  the  neutrino  spinor  by  (1  ±  y^).  The  violation  of  C  and  P  invariance 
in  the  decay  processes  not  involving  neutrinos  can  also  be  explained  provided 
we  can  convince  ourselves  that  we  should  insert  (1  ±  y^)  before  the  proper  fields. 
However  we  recall  that  the  zero  mass  of  the  neutrino  played  an  important 
role  in  obtaining  the  factor  (1  ±75)  and  arguments  applicable  to  the  neutrino 
cannot  be  used  in  the  case  of  particles  with  mass.  Recently  Marshak  and  Sudar- 
shan^  and  independently  Feynman  and  Gell-Mann^  have  advanced  arguments 
for  attaching  the  factor  (1  ±  ^5)  to  all  the  fermion  fields  in  the  quadrilinear 
Fermi  interaction.  The  arguments  are  apparently  different  but  are  actually  equi- 
valent and  we  here  present  the  theory  of  Feynman  and  Gell-Mann^  in  some  detail. 
When  the  Dirac  equation  for  a  particle  of  mass  m  is  written  as 

{iV  —  m)y)  =  0  (5) 

it  is  of  the  first  degree  with  y)  having  four  components.  However  the  Klein- 
Gordon  equation  describing  a  spin  zero  particle  is  of  the  second  degree  and  relates 
to  a  one  component  wave  function.  Thus  it  should  be  possible  to  obtain  a  "two 
component"  spinor  equation  for  particles  with  spin  ^/g  by  raising  the  degree 
of  the  equation.  To  arrive  at  this  let  us  define  a  ^  with  four  components  by 

W= — (i'^+m)x  (6) 

y  satisfies  the  second  degree  equation 

{.ilfX='m?'x-  (7) 

It  is  important  to  note  that  any  solution  of  the  Dirac  equation  will  be  a  solution 
of  the  second  degree  equation  while  any  solution  of  the  second  degree  equation 
need  not  be  a  solution  of  the  Dirac  equation.  While  yg*  does  not  commute  with 
iA,  it  does  so  with  (iV)^.  Hence  we  can  find  ;f's  which  satisfy  the  second  degree 
equation  as  well  as  the  relation  y^x  =  :tX-  "^^^^  ^^  easily  done,  for 


^  M.  Gell-Mann  and  A.  H.  Rosenfeld,  loc.  cit. 

2  E.  C.  G.  SuDARSHAN  and  R.  E.  Marshak,  Fhys.  Rev.  109,  1860  (1958). 

3  R.  P.  Feynman  and  M.  Gell-MAnn,  Phys.  Rev.  109,  193  (1953);  also  J.  J.  Sakurai, 
Nuovo  Cim.  7,  649  (1958). 

*  The  ^5  used  in  this  section  corresponds  to  the  iy^  used  in  the  earlier  part  of  the  book. 
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If  we  write  V^  ^  ( r  )  where  a  and  b  are  two- component  spinors,  from  the  usual 
representation  of  Dirae  matrices  we  find 


and 


We  thus  notice  that  j^  and  ;f'  are  made  up  of  two  identical  two-component  spinors 
and  from  the  equation  obeyed  by  x  i^  is  clear  that  both  q)  and  0  obey  the 
same  equation.^  Feynman  and  Grcll-Mann'^  argued  that  there  was  no  particular 
reason  why  we  should  couple  the  solutions  of  the  first  order  Dirac  equation 
and  not  those  of  the  second  order.  Taking  (p  to  be  fundamental  it  was  postulated 
that  the  fermion  fields  rp  should  be  replaced  by  0  =-  J(l  +  y^)  y),  the  solution  of 
the  second  order  equation.  This  implies  that  we  replace  ^  by  J  (1  +  y^)y)  and 

V'  ^y  V^ 9 •  Using  the  properties  of  the  Dirac  matrices,  we  find  that 

(1  -  75)0,(1+ 75)  =  0  for  i=S,T,P. 

=  20i(l  +  y^)  for  i^  A,V.  (12) 

Thus  the  ^  deca,y  interaction  takes  the  form 

iO{ipn  y,,(^Wp)  (feyfi^^)  .  (13) 

where  a  =  (I  -{-  y^)  and  (?  is  a  single  coupling  constant  for  both  V  and  A  and  the 
coupling  is  V  —  A.^  From  the  recent  experiments  on  the  asymmetry  of  decay 
electrons  in  the  n-ju-e  decay  chain,  we  have  already  concluded  that  the  above 
interaction  should  be  present  in  /u  decay.  We  have  also  shown  that  this  coupling 
will  predict  the  correct  handedness  of  the  particles  in  processes  like  the  fi  decaj^ 
and  K  capture. 

For  example,  this  theory  predicts  that  the  neutrino  is  always  left-handed, 
which  was  confirmed  by  Goldhaber's*  experiment. 

In  contrast  to  the  two-component  theory  of  the  neutrino,  which  can  explain 
the  C  and  P  violation  only  in  processes  involving  neutrinos,  the  above  theory 
can  explain  the  violation  of  C  and  P  in  any  weak  decay.  Already  in  the  tetra- 
hedron scheme  we  have  indicated  how  all  the  weak  processes  can  be  thought 
of  as  arising  from  four-fermion  interactions.  For  this  we  have  to  envisage  ne\\' 


^  For  electrodynamics  in  the  "two-component"  form,  see  L.  M.  Brown,  Phys.  Rev.  Ill, 
957  (1958).  Also  G.Marx,  Nuclear  Physics  9,  337  (1958). 

2  R.  P.  Feynman  and  M.  Gell-Mann,  loc.  cit. 

3  If  we  were  to  use  right-handed  neutrino,  i.e.  (1  —75)^^  we  would  have  obtained  V  +  A 
coupling  which  does  not  agree  with  experiment. 

^  M.  Goldhaber,  L.  W.  Grodzins  and  A.  W.  Sunyar.  loc.  cit. 
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couplings  like  {A^p)(vfJ,),  {A^p){ve),  {A^p){pn)  as  in  the  tetrahedron  scheme. 
The  parity  violation  in  the  pion  decay  mode  of  K+  can  be  envisaged  as  follows : 

K^  -^  A^  +  p  (strong  virtual  interaction)  (15) 

A9  +  p-^p  +  n^{27i  or  3:^)  (UFI).  (16) 

In  the  second  step  of  the  reaction  violation  of  parity  and  charge  conjugation 
in  variance  takes  place. 

It  is  remarkable  that  the  vector  coupling  constant  obtained  from  the  decay 
of  O^^  (pure  Fermi  j^  decay)  should  give  the  lifetime  of  fj,  decay  correctly.  While 
ju  decay  involves  only  leptons,  the  nucleon  can  emit  and  absorb  virtual  pions. 
It  is  rather  puzzling  that  the  vector  part  of  the  weak  decay  coupling  constant  is 
not  affected  by  the  strong  virtual  pion  interaction.  In  other  words  the  decay 
constant  need  not  be  renormalized.  We  refer  to  a  similar  feature  in  electro- 
dynamics, namely,  that  the  charge  of  a  particle,  i.e.  the  coupling  constant  to 
electromagnetic  field  is  the  same  for  all  the  particles.  This  feature  is  essentially 
a  consequence  of  the  conservation  law  for  electric  charge  and  current.  Feynman 
and  Gell-Mann^  have  shown  how  the  vector  part  of  the  Fermi  coupling  can  be 
made  to  have  no  renormaUzation. 

The  field  theoretic  Lagrangian  for  the  four  Fermion  processes  would  include 
the  six  terms  containing  pairs  selected  from  the  four  bilinears  terms  (fiv)  ,(ev),  {pn) 
and  (pA)  and  their  hermitian  conjugates.  (We  could  of  course  add  to  these  all 
the  equivalent  meson  or  baryon  currents.)  If  to  these  twelve  terms  we  add  the 
four  terms  representing  the  coupUng  of  each  bilinear  term  to  itself  then  the  inter- 
action can  be  written  as  Jj,J^  where 

J^={ve)-\-{flv)  +  {pn)  +  {pA).  (17) 

An  interaction  of  this  type  is  called  a  current-current  interaction,  the  total 
current  comprising  the  sum  of  the  individual  currents.  Such  a  Lagrangian  is 
possible  only  if  it  is  permissible  to  include  the  self  coupling  terms.  The  conse- 
quences of  inclusion  of  such  squared  terms  are : 

(1)  Neutrino-electron  scattering  arising  from,  {v  e)^v  e) .  This  scattering  process 
can  occur  even  in  the  absence  of  the  squared  term  as  a  second  order  effect  in  the 
weak  interaction  which  however  would  be  far  weaker  than  the  interaction  due 
to  direct  coupling. 

(2)  Pair  production  of  neutrinos  by  an  electron  pair  or  y-rays.  The  matrix  ele- 
ment for  the  first  process  vanishes  in  any  local  theory^.  The  second  process  could 
happen  in  stars  and  supernovae.  But  any  process  involving  neutrinos  leads  to 
the  loss  of  neutrino  flux  and  hence  of  energy  and  does  not  lead  to  equilibrium. 


^  R.  P.  Feynman  and  M.  Gell-Mann,  Phys.  Rev.  109,  193  (1958);  S.  S.  Gershtein  and 
J.  B.  Zei/dovich,  J.  Expt.  Theoret.  Phys.  2,  576  (1957).  These  authors  suggested  this  idea 
before  non-conservation  of  parity  was  postulated. 

2  M.  Gell-Mann,  Phys.  Rev.  Lett.  6,  70  (1961). 
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The  cross-sections  for  these  processes  are  just  small  enough  to  leave  the  present 
theories  of  supernovae  unaffected. 

(3)  A  short  range  parity  violating  nuclear  force  arisin  gout  of  {pn)^  (pw).  The 
force  due  to  this  local  coupling  will  be  spread  out  due  to  virtual  strong  interactions. 

What  we  wish  to  show  now  is  the  possibility  of  not  renormalizing  the  total  current, 
i.e.  in  the  total  current  at  least  the  vector  part  of  «/^  is  unaffected  by  the  presence 
of  other  interactions.  As  we  have  noted  above,  the  currents  involving  only  leptons 
will  not  be  affected  by  pion  interactions.  However  considering  the  nucleon  cur- 
rent and  splitting  it  into  vector  and  axial  vector  parts,  we  have 

JN=i(J^  +  Ji)  (18) 

ivhere 

JN  =  9pyf*Wn\      H  =  %y,.y^Wn'  (i9) 

Using  isotopic  spin  formalism  the  vector  part  can  be  written  as 

JN  =  \y>y^.^.W'  (20) 

The  electric  current  is  given  by 

J^i,  =  fy,-~^v-  (21) 

The  isovector  part  of  the  electric  current  is 

JV  =  iVV,r,ri>.  (22) 

Comparing  (20)  and  (22)  we  see  that  they  can  be  obtained  by  different  projections 
of  the  same  isotopic  vector.  Thus  we  can  expect  that  the  matrix  elements  of 
vector  part  of  the  electromagnetic  and  weak  interactions  should  be  related; 
especially  since  the  strong  interactions  are  isotopically  invariant,  the  vector  parts 
of  these  two  interactions  should  have  similar  properties.^  The  term  |y)y^(l  +  r^)^ 
is  not  conserved  unless  we  add  the  electric  current  of  pions  and  strange  particle. 
In  analogy  we  can  argue  now  that  J^  will  not  be  conserved  unless  we  add  the 
pion  and  strange  particle  contributions  to  it.  Thus  the  conserved  Jj  is 

JN  =  Wyf.Q^W  +  i  [9^*  Q^^t.^-  i^n  (P*)  Q+<P]  +  "'  (23) 

If  we  substitute  this  current  for  the  vector  part  in  (17)  the  vector  coupling  con- 
stant will  not  be  affected.  The  newly  added  terms  predict  the  transition 
n^-^7if^  +  V  +  e^,  a  process  of  low  probability.  These  terms  will  predict  the 
processes 

K^->K+  +  e-  +  v,  Ko->K-  +  e+  +  v,  I-^Z^+e-  +  v  and  S-^S^+e-  +  v.  (24) 


^  It  is  interesting  to  note  that  the  conservation  of  vector  part  of  the  weak  interaction  current 
is  automatic  in  Sakata's  model  which  considers  only  p,  n,  and  A^  as  elementary  and  all  the 
other  particles  as  composite.  See  "Strange  Particle  Decays"  by  L.  B.  Okun,  Ann.  Rev.  Nud. 
Set.  9,  61  (1959). 
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While  the  first  two  processes  depend  only  on  the  vector  coupling  constants,  the 
remaining  two  processes  can  also  proceed  through  axial  vector  current,  the  reason 
being  that  it  is  impossible  to  construct  axial  vector  currents  bilinear  in  meson 
fields  of  the  same  parity  and  zero  spin. 

Pursuing  the  analogy  of  the  vector  part  of  the  weak  interaction  with  the  electro- 
magnetic interaction  Gell-Mann^  noticed  an  important  correction  to  the  nuclear 
matrix  element  in  ^  decay  which  he  called  "weak  magnetism"  corresponding 
to  the  magnetic  form  factor  occurring  in  the  nuclear  matrix  element  for  electro- 
magnetic interaction.  The  magnitude  of  this  "weak  magnetism"  can  be  precisely 
estimated  if  we  accept  the  conserved  vector  current  hypothesis.  Then  the 
weak,  magnetism  form  factor  is  equal  to  the  moment  form  factor  of  Hofstadter 
et  al}  The  effect  of  weak  magnetism  can  be  best  estimated  if  we  have  y  and  P 
decay  from  two  initial  nuclear  states  both  of  them  belonging  to  the  same  iso- 
multiplet.  Such  an  experiment  has  been  proposed  by  Gell-Mann  utilizing  the  P", 
y  and  j5+  decay  of  B^^,  C^^*  and  N^^  respectively  which  form  an  isomultiplet. 
The  effect  of  weak  magnetism  is  considerable  in  ^~  capture.^ 

Regarding  the  axial  vector  part  of  Jjy^  it  is  not  clear  whether  there  should  be 
any  renormalization  or  not.  The  present  evidence  from  the  decay  of  free 
neutrons  indicates  that  the  ratio  of  Gamow-Teller  to  Fermi  coupling  constants 
is  1.19  :  0.4.^  It  is  hoped  that  departure  from  unity  is  due  to  renormalization. 
Conservation  of  axial  vector  current  is  not  desirable  as  it  forbids  ji-fji  and  7t-e 
decays.  Further  in  contrast  to  the  case  of  vector  current  such  a  hypothesis  does 
not  lead  to  equality  of  axial  vector  coupling  constants,  i.e.  absence  of  renormali- 
zation effects.^  It  has  been  noted  by  Goldberger  and  Treiman^  that  this  hypo- 
thesis gives  a  larger  effective  pseudoscalar  interaction  in  ^  decay.  The  situation 
concerning  strangeness  non- conserving  current  like  (A^'p)  is  more  complicated. 
As  for  the  strangeness- conserving  current  the  axial  part  of  this  current  should 
not  be  conserved  as  it  will  forbid  K^^.  decay  (we  have  assumed  that  K^  is  pseudo- 
scalar).  Since  A^  and  f  belong  to  different  isomultiplets  conservation  of  the  vector 
part  of  this  current  will  not  lead  to  the  absence  of  renormalization  effects  for 
coupling  constants  unless  "global  symmetry"  is  assumed. 

When  the  Universal  Fermi  Interaction  with  V-A  coupling  was  proposed  it 
was  faced  with  what  seemed  an  inexplicable  fact  that  the  experimental  ratio  of 

jf~~  ^  p~  —1—  y 

the  two  decay  modes  of  pion is  <  10"^  while  the  UFI  predicted 

•^  ^         n  -^ju    +  V 

1  M.  Gell-Mann,  Phys.  Rev.  Ill,  362  (1958). 

2  R.  Hofstadter  et  ah,  Phys.  Rev.  Lett.  6,  290,  293  (1960). 

^  Unlike  in  j3  decay,  the  momentum  transfer  is  large  in  this  case  and  hence  the  effect  of  weak 
magnetism  will  be  large.  See  S.  Weinberg,  Phys.  Rev.  Lett.  2.  223  (1959). 

4  M.  T.  BuRGY  et  at.,  Phys.  Rev.  110,  1214  (1958). 

^  For  a  more  complete  discussion  of  this  question,  see  Problems  in  Theory  of  Weak  Inter- 
actions R.  Gatto,  Fortschritte  der  Physik,  X,  147—181  (1959);  see  also  R.J.  Blin-Stoyle, 
Nuovo  Cim.  10,  132  (1958).  See  also  A.  P.  Balachandran,  Nuovo  Cim.  In  press. 

6  M.  L.  Goldberger  and  S.  B.  Treiman,  Phys.  Rev.  110,  1478  (1958). 

epcr     18 
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a  ratio  — 10  *.^  However  recent  experiments*  have  borne  out  the  ratio  ^^10  '. 
The  experiment  is  based  on  the  fact  that  the  electron  emitted  in  the  direct 
Ti^e  +  V  decay  has  a  sharp  energy  .^72  MeV  while  the  electrons  emitted  in 
7i-fi-e  decay  chain  has  a  spectrum  and  the  maximum  energy  is  only  52  MeV. 
According  to  UFI  the  pion  undergoes  a  strong  virtual  interaction  and  dis- 
sociates into  any  baryon-anti-baryon  pair  consistent  with  charge  independence 
and  strangeness  conservation  such  as  {fu),  {Z'^,A)  etc.  the  annihilation  of  the 
anti-baryon-baryon  pair  resulting  in  the  production  oi  fjt,  y  or  e,  v  through  UFI. 


Fig.  37.  n-fj,  decay. 

The  black  box  denotes  all  possible  strong  interactions.  The  pseudoscalar  nature 
of  the  pion  demands  that  these  currents  in  UFI  can  only  be  axial  vector.  Treat- 
ing the  Yukawa  process  in  the  lowest  order,  Finkelstein  and  Ruderman®  showed 
that  this  ratio  is  — 1.3  x  10^*.  It  is  important  to  notice  that  in  the  n  —  e  decay 
mode  both  the  electron  and  neutrino  are  relativistic  while  the  /^-meson  is  not  so. 
Hence  the  difference  in  the  processes.  A  similar  difficulty  existed  with  the  radiative 
decay  of  the  pion,  i.e. 

7i->e  +  V  +  y  and  ti ->  (jl  +  v  +  y .  (25) 

Treiman  and  Wyld*  have  discussed  the  process  under  various  couplings  and  find 
that  it  has  a  larger  cross-section  for  tensor  coupling.  Expressing  the  ratio 
of  the  radiative  to  the  ordinary  decay  mode  by  q' ,  Cassels^  has  found  that  the 
ordinary  pion  decay  goes  through  the  axial  vector  coupling.  The  ratio  is  much 
higher  for  tensor  coupling.  In  the  decay  of  the  pion  six  possible  baryon-anti-baryon 
pairs  can  occur  in  the  intermediate  states  and  may  have  interference  effects 
which  cannot  be  estimated  with  present  computational  techniques.  Attempts  have 
recently  been  made  to  apply  the  theory  of  dispersion  relations  to  n-  and  ti^ 
decay.  Retaining  nucleon  pair  alone  in  the  intermediate  state  Goldberger  and 


^  See  for  a  thorough  discussion  on  this  difficulty  "  Forbidden  n  —  ^  decay  "  by  R.  P.  Feyn- 
MAN.  International  Conference  on  High  Energy  Physics  at  CERN  (1958)  p.  216,  Ed.  B.  Feretti. 

2  Fazzini  et  al.,  Phys.  Rev.  Lett.  1,  247  (1958);  Impeduglia  et  al.,  Phys.  Rev.  Lett.  1,  249 
(1958);  H.  L.  Anderson  et  al.,  Phys.  Rev.  Lett.  2,  53  (1959). 

3  M.  RuDERMAN  and  R.  Finkelstein,  Phys.  Rev.  76,  1458  (1949). 

4  S.  B.  Treiman  and  H.  W.  Wyld  Jr.,  Phys.  Rev.  106,  1320  (1957). 

5  J.  M.  Cassels  etal.,  Proc.  Phys.  Soc.  A  70,  729  (1957). 


WEAK    INTERACTIONS  275 

Treiman^  have  obtained  surprisingly  good  agreement  with  experimental  decay 
rate  for  tt  —  /^  decay. 

The  theory  of  weak  interactions  developed  above  is  based  on  the  cm-rent- 
current  interaction,  i.e.  JjyJ^r.  The  current  Jj^  consists  of  these  terms 

(1)  a  leptonic  current  ji  consisting  of  terms  {/li~v),  {e-v) 

(2)  a  strangeness- conserving  current  jg^  involving  terms  like  (np),  (A A) 

(3)  a  strangeness  non- conserving  current  ^^^^  containing  terms  like  {pA^),  (pl^^), 

/u  decay  is  a  pure  leptonic  decay  and  arises  from  the  coupling  of  y^  with  itself,  j}  ji  will 
also  describe  electron-neutrino  scattering,  pi  capture,  n  decay  and  P  decays  are 
described  by  the  various  terms  arising  from  jl^  ji .  Leptonic  decays  of  strange 
particles  are  due  to  the  coupling  jlndi  while  the  non-leptonic  decays  of  strange 
particles  arise  from  the  interaction  term  jjuchc-  ^^  important  property  of  all 
the  terms  in  the  three  currents  are  that  they  are  charged  and  no  term  is  com- 
posed of  oppositely  charged  particles.  The  absence  of  neutral  currents  has  not 
hitherto  been  explained.^  The  presence  of  neutral  currents  will  induce  processes 
like  /^+->e+  +  e~  +  e+  which  seem  to  be  rare  or  absent. 

An  interesting  rule  has  been  proposed  by  Feynman  and  Gell-Mann^  for  the 

AQ 

terms  in  ;/s„g  which  states  that  the  terms  should  obey  — r^  =  +  1.  This  rule  forbids 

—  An  1/1 

terms  like  (27+,  n)  in  which         '  ==  — 1  and  {pS^)  which  has  —r~=  +  V2  ^-nd 

AS  AS 

also  involves  AS  =  +2.  The  presence  of  such  terms  can  be  tested  as  follows. 
The  coupling  of  (27+,  n)  with  {e~,v)  will  lead  to  an  appreciable  Z'^  ->  n  +  e^  -\-  v 
which  has  not  been  observed  so  far.*  The  coupling  of  (2"+,  n)  with  (pA^)  will 
lead  to  weak  interactions  having  AS  =  ±2.  The  terms  like  (pS^)  will  induce  the 
decay  S^  ~>  p  —  7t~  {AS=  -^2)  instead  of  the  observed  two-step  decay.  As 
Okun  and  Pontecorvo^'^  have  pointed  out  the  presence  of  such  weak  interactions 
will  lead  to  a  more  frequent  oscillation  (.^10"^^  sec)  between  K^  and  K^  instead 
of  the  normal  oscillation  (-^  10^^^  sec)  and  the  transition  K^  —  K^  will  be  pro- 
portional to  G  instead  of  G^  as  it  would  be  in  the  absence  of  these  interactions. 
The  present  experimental  evidence  does  not  favour  rapid  oscillation  of  K^  and  K^. 


1  M.  L.  GoLDBERGER  and  S.  B.  Treiman,  Phys.  Rev.  110,  1216,  1478  (1958). 

2  See  G.  Gamba  and  R.  E.  Marshak,  Proc.  Nat.  Acad.  Sci.,  45,  881  (1959).  In  this  paper, 
the  authors  suggest  an  interesting  symmetry  between  A  —  n  —  p  and  ju  —  e  —  v. 

^  R.  P.  Feynman  and  M.  Gell-Mann,  loc.  cit. 

4  J.  Leitner  et  al.,  Phys.  Rev.  Lett.  3,  186  (1959). 

^  L.  B.  Okun'  and  B.  Pontecorvo,  J.  Expt.  Theoret.  Phys.  5,  1297  (1957). 

^  It  is  usually  argued  that  since  the  mass  difference  between  Ki  and  ^^2  is  small,  it  can  arise 
only  due  to  a  second  order  weak  interaction  so  that  first  order  AS  =  2  non-leptonic  decays 
should  be  forbidden.  It  can  however  be  shown  (see  S.  L.  Glashow,  Phys.  Rev.  Lett.  6,  196 
(1961))  that  even  iiA8  =  2  decays  are  allowed  only  that  part  of  the  transition  matrix  element 
which  is  even  under  charge  conjugation  contributes  to  dm,  so  that  it  is  only  required  that 
part  of  the  interaction  Lagrangian  even  under  C  be  no  more  than  10"^  times  as  strong  as 
the  usual  AS  =  I  interaction  ^m  is  the  K^K^^  mass  difference. 

18* 
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Earlier  we  have  discussed  the  |zl/ 1  =  1/2  rule  for  strange  particle  decays. 
Sudarshan  et  al}  have  attempted  to  explain  this  rule  by  postulating  that 
the  jfne  transforms  like  an  isospinor.  This  transformation  property  will  give  for 
the  leptonic  decay  of  strange  particles  the  rule  |/^  / 1  =  ^/2  for  the  strangeness 
non-conserving  part.  A  very  important  prediction  of  these  rules  is  that  the  energy 
spectra  of  secondary  particles,  angular  distribution  and  polarization  in  iC^g 
decays  of  K-*-(K-)  and  K^{K^)  mesons  should  be  similar  and  the  total  decay  rates 
should  be  in  the  ratio  1:2.  We  also  have  the  result  that  three-body  leptonic  decay 
modes  of  XJ  and  K^  should  have  equal  probabilities.  This  implies  for  the  decay 
rates  (o  the  relations 

co{Kl->^7i-  -h  I*  +  v)  =  o}{Kl-^7i-  -^  r+  v)  =  co{K^-^7iP  +  l-'+v)      (26) 

where  I  refers  to  /n  or  e.  Estimates  of  the  lifetime  of  K^  from  the  knowledge  of 
X^3  and  Kg^  and  K^  decays  are  in  general  agreement  with  present  experimental 

AQ 
value.  However  the  rule  — r^  =  + 1  does  not  yield  the  2  :  1  ratio  of  charged  to 

neutral  decays  of  A^  which  is  predicted  by  |^  / 1  =  ^/2  rule  when  applied  to  the  whole 

process.  Further  this  rule  in  the  lowest  approximation  prohibits  Z+  decays  while 

allowing  2""  decay.  Recent  experimental  evidence  seems  to  indicate  that  the 

probabilities  of  the  three  modes  are  equal  and  the  decays  of  Z"  are  well  described  by 

the  |zl  / 1  =  1/2  rule.^  Since  A^  decay  seems  to  obey  the  rule  |zl  / 1  =  1/2  in  the 

AQ 
lowest  approximation  it  is  difficult  to  reconcile      .      =  1  with  the  non-leptonic 

27  decays.  It  is  interesting  to  note  that  to  obtain  |Zl  /  |  =  ^/^  for  A^  decay  we  are 

forced  to  include  neutral  currents  like  nn.  The  observation  of  i7+  ->n  +  e'^  (ju'^)  +  v 

AQ 
idll  invalidate  the  rule  —7-q-  =  + 1 .  Only  increasing  experimental  evidence  on 

polarization  properties  in  various  non-leptonic  decay  of  strange  particles  can 
throw  light  on  these  conflicting  issues. 

Another  problem  which  remains  to  be  solved  concerns  the  ^  decay  of  hjrperons,  i.e. 


A^ 


p+  e-  +  V 

p  +  ju-  +  V  (27) 

n  +  e-  +  V 

n  +  ju-  -I-  V.  (28) 

Assuming  renormaUzation  effects  to  be  small  we  expect  that  the  fractions  of 
the  p  decay  oi  A  and  Z~  should  be  1.6  and  5.6  per  cent  respectively.  However 
the  present  experimental  evidence  seems  to  indicate  these  decays  are  much  less 


^  E.  C.  G.  Sudarshan  et  al.,  Phys.  Rev.  112,  665  (1958).  This  paper  also  contains  references 
to  their  earlier  work  on  the  subject. 
2  Cool  et  al.,  loc.  cit. 
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frequent  and  if  we  take  this  seriously  we  are  forced  to  conclude  that  the  jg^^^  is 
weaker  than  the  jgc  by  a  factor  of  13.^ 

Finally  we  shall  take  up  the  problem  of  why  all  the  terms  in  weak  interactions 
should  be  charged.  In  analogy  with  the  electromagnetic  interaction  we  can 
assume  that  weak  interactions  are  not  primary  but  are  caused  by  an  exchange 
of  a  charged  vector  boson  B  between  the  two  currents. ^  This  will  at  once  explain 
the  absence  of  neutral  currents.  Following  this  argument  we  conclude  that  the 
mass  of  the  boson  must  be  greater  than  the  mass  of  iT-meson  in  order  to  pre- 
vent the  latter  from  undergoing  decay  through  the  boson,  i.e.  m^  >  m^.  The 
lifetime  of  B  is  estimated  to  be  10~^^  sec  and  the  direct  observation  of  m^  is 
rather  difficult.^  However  if  B  exists  it  causes  the  decay  /^  ->►  e  +  y.*  It  is  found 
that  the  estimated  branching  ratio  is  greater  than  the  experimental  value 
'-^2  X  10~^.  Thus  we  can  conclude  that  B  does  not  exist. ^ 


3.    SOME   RECENT  WORK  ON  THE  THEORY  OF 
WEAK  INTERACTIONS 

In  our  previous  discussion,  we  have  referred  to  a  few  problems  which  are 
yet  to  be  understood  theoretically.  The  following  three  problems  have  been 
explained  to  some  extent  by  recent  work: 

(a)  The  life  time  of  n-^  decay  and  the  surprising  validity  of  the  Goldberger- 
Treiman  formula ;  ^ 

(b)  The  up-down  asymmetries  of  A  and  Z  hjrperon  decays  which  suggest 
a  need  for  an  extension  of  y^  invariance  in  a  manner  such  that  left-handed  fields 
occur  in  A  decay  while  both  right-  and  left-handed  fields  occur  in  E  decays ; '" 

(c)  The  1^  /  I  =  ^/2  rule  and  the  need  for  the  inclusion  of  neutral  currents. 
We  shall  summarize  here  Giirsey's  attempts  in  devising  a  theory  to  explain 

such  problems®  and  make  references  to  similar  works  of  others.   Giirsey  has 


1  R.  H.  Dalitz,  Rev.  Mod.  Phys.  31,  823  (1959).  This  volume  also  contains  interesting 
papers  on  weak  interactions  presented  at  Gatlinburg  Conference,  1958. 

2  R.  P.  Feynman  and  M.  Gell-Mann,  loc.  cit. 

^  If  the  intermediate  vector  boson  B  has  a  mass  roughly  equal  to  that  of  K,  the  cross- 
section  for  the  process 

v  +  e~->B->v+/bi~ 

will  show  a  resonance  at  2.5  x  10^^  eV.  The  cross-section  at  resonance  is  independent  of  the 
strength  of  the  interaction  and  the  width  is  related  to  the  lifetime  of  B  (see  T.  Kinoshita, 
Phys.  Rev.  Lett.  4,  378  (I960)). 

*  G.  Feinberg,  Phys.  Rev.  110,  1482  (1958). 

^  H.  F.  Davis  et  ah,  Phys.  Rev.  Lett.  2,  211  (1959);  D.  Berlyet  et  al.,  Phys.  Rev.  Lett. 
2,  357  (1959). 

«  M.  L.  GoLDBERGER  and  S.  B.  Treiman,  Phys.  Rev.  110,  1478  (1958). 

'  S.  A.  Bludman,  "  Phenomenological  analysis  of  hyperon  decay  rates  and  asymmetries", 
Gatlinburg  Conference  on  Weak  Interactions  (October,  1958);  Bull.  Amer.  Phys.  Soc.  9,  83 
(1959);  also  Phys  Rev.  115,  468  (1959). 

8  F.  GiJRSEY,  Nuovo  Cim.  16,  230  (1960);  also  "On  the  Structure  and  Parity  of  Weak 
Interaction  Currents",  preprint.  Institute  for  Advanced  Study  (1960). 
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considered  the  possibility  of  a  four-dimensional  extension  G^  of  the  isotopic 
spin  group  G^ ,  such  that  G^  together  with  the  hypercharge  gauge  group  contains 
charge  independence,  strangeness  selection  rules  and  the  empirical  symmetries 
of  weak  interactions  as  various  subgroups.  The  model  treated  by  him  should 
be  regarded  as  an  illustration  of  the  possibility  that  the  symmetries  of  weak 
interactions  may  be  essentially  contained  in  those  of  strong  interactions  and 
not  as  a  definite  proposal  for  a  theory  of  elementary  particles. 

The  strong  interaction  Lagrangian  is  taken  to  consist  of  two  parts  L  and  L' 
where  L  is  rigorously  invariant  under  6^4  and  U  is  invariant  only  under  G^  and 
acts  as  a  perturbation  owL.  G^  in  variance  implies  a  doublet  approximation  {DA) 
for  baryons  and  U  acts  as  a  doublet  perturbation  (DP).  If  G^\&  approximately 
valid  for  strong  interactions,  one  may  hope  that  renormalization  effects  will  be 
small  for  any  interaction  invariant  under  a  subgroup  of  Gj^  and  especially  for  weak 
interactions.  To  define  G^  we  first  take  a  strongly  interacting  system  like  pions 
and  nucleons.  Consider  the  following  transformation  of  the  n  —  N  system : 

G'^:\p-^Q^'^-'-'"\p,  (1) 

This  does  not  leave  the  nucleon  equation  invariant  unless  m  =^  0.  The  trans- 
formed equation  reads: 

iy^d^y)  =  me^^''^'''*y)  (2) 

with  ix}-=  a  and  d^,  =  .  FoUomng  Nishijima^,  let  us  identify  a  with  the  pion 

«  =  /^  =  |f  (3) 

where  /  and  g  are  the  pseudo vector  and  pseudoscalar  coupling  constants.  We 
then  obtain 

g^  w^ 
iy^^d^y)  =  me^^'y^^'^'y)  ^  my)  -\-  igy^r  -  (ftp V^  +  •  •  •  (4) 

(2)  may  be  wTitten  as  •      o  ^ 

iy^d^y)L  =  m(pxpR 

where-  _ 

and  -       — 

00  =.00=  I.  (7) 

Now  we  may  define  two  three-dimensional  commuting  rotation  groups  G^^ 
and  G^^  operating  on  left-  and  right-handed  nucleons  independently.  Adjoining 
G^^^  to  Gf\  we  obtain  the  four-dimensional  rotation  group  G^. 

G^^^:y)R-^e^^*\pR,         0-^e^^^0  Wl^Wl  (9) 

1  K.  NismjiMA,  Niurvo  Cim.  11,  910  (1959). 

2  The  ^5  here  is  equal  to  iy^  of  chapter  I. 
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with 


0- 


exp  \i 


i  +  rs  1 


75 


TV 


giT-y  0  Q-ir-n 


(10) 


The  isotopic  spin  group  G^  obtained  by  taking  ^^  v  =^  O)  commutes  with  CP, 
C  and  P  while  G^^  and  Gf^  commute  only  with  CP  and  not  with  C  and  P. 

The  pion  kinetic  energy  term  in  the  strong  Lagrangian  can  be  made  invariant 
under  6/4  by  the  substitution 

1 


{^,<fy 


nTr[{d^0){d^0)]K 


(11) 


The  first  term  in  the  expansion  in  powers  of  /  is  the  usual  pion  kinetic  energy 
while  the  additional  terms  represent  pion-pion  interactions.  The  pion  bare  mass 
term  however  cannot  be  made  invariant  under  G^  and  will  consequently  act 
as  a  DP. 

We  may  now  assume  the  following  transformation  properties  of  the  other  fields. 
For  the  S  field,  S^  is  assumed  to  transform  like  ipji  and  Sji  like  ipj^  so  that  its 
interaction  with  (p  will  be 

iy^^^^Ws  =  m^exp{— 2i/75T  •  g^ips-  (12) 

We  also  introduce  the  following  matrices  for  the  A  and  E  hyperons : 


Al+IT'  Sj/,     Yr  =  Ar+  ir  .  Zj 


They  are  assumed  to  transform  under  G^  as 


->-e' 


T-^y^e-vT/i 


e*^"7^e 


so  that  we  have  the  following  equations  of  motion  invariant  under  G^ : 


'r^d^,YL  =  mY0YR 


The  matrix 
transforms  as 


iyA. 

Yn- 

=  my' 

0Yj^ 

Y'l- 

-A^ 

—  ^T 

•^L 

n- 

^e*^" 

T^e-^^-^ 

(13) 
(14) 

(15) 
(16) 

(17) 


while  Y'ji  =-  Aji  ^  ir  -  Hr  transforms  as  F^.  The  iT- meson  doublet  is  assumed 
to  transform  like  ipji . 

The  group  G^  may  be  enlarged  to  {G^  x  H)  where  H  is  the  hypercharge  gauge 
group  defined  by 

H-.tp-^e'^'y),         S-^e-'^'S,          K-^e'^'K  (18) 

(^4  X  H)  thus  admits  H,  G^,  G^^^  and  Gf^  as  subgroups.  When  the  DP  is  switched 
on,  (t4  in  variance  is  lost  and  the  strong  interactions  are  invariant  only  under  H 
and  (rg.  Now  since  charge  must  be  conserved  in  all  cases,  we  may  enlarge  6r^^^ 
and  G^^  to  4-parameter  unitary  groups  which  also  describe  charge  conservation. 
Any  interaction  invariant  under  G^^  or  G^^  alone  necessarily  violates  parity. 
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Thus  we  can  have  two  classes  of  weak  interactions,  Class  I  invariant  under  O^^ 
and  Class  II  under  0^^\  First  we  consider  the  unitary  subgi'oup  U^^^  of  {O^  x  H) 
obtained  by  taking  Vj  =  y^  ^  0,  Vj  =  u.  Then  under  l!7<i)  we  have 

Yl  —  e*^** Yj^ e-*''.*',         Yr  -^ e*'.** Y^ e'^^" 
"0_^e»'  'M^e'^*",  jK'^e*"e*^«ir.  (19) 

Similarly  we  have  another  unitary  group  associated  with  O^^,  namely, 

5^_^e-»"e»»"£'^,  S"^  — e-^<i-'^3)t*5'^ 

Yl  -^  e*^."  7^  e-»^',         Y^  -^  e*»-  7^  e"*^' 
0-^e'^*'*0e-'^  \  i:-^e*"e»»-'iC.  (20) 

To  include  leptons,  we  may  define  the  transformation  properties  of  the  lepton 

current 

^A.  =  (e  +  ft)y,v  =  i(i  +  fi)y^{l  +  n)^  (21) 

as 

C7(i):i^^e2^"Z^.  (22) 

Now  we  define  the  matrix 


0  = 
with 


-K-     K^ 


=  K\  +  iTK  (23) 


t7(i):0->e*^»"!9e-^^»".  (24) 


If  the  decay  of  the  A'-meson  into  pions  is  described  by  Class  I  interactions,  we 
have  the  interaction  invariant  under  U^^^  as 

L^  =  aanTr[{0d^0){d^6  +  Kl(0d^0)]\+h-c,  (25) 

where  a  is  a  dimensionless  coupling  constant  and  Q  the  Fermi  constant.  Expand- 
ing 0,  we  have 

L^^AaQ  n  iKl(d^  q>f  +  if"  (8^  K)  (S,  <p) 

~{d,K){<p.d^<p)  +  jf{e,K){f^d,<p-2<pe^<p^)].  (26) 

The  first  and  fourth  terms  of  (26)  lead  respectively  to  the  two  pion  decay 
of  ill  and  the  three  pion  decay  of  ^2  and  K^.  The  M^l  =  ^/2  rule  holds  since  Lj^ 
transforms  like  an  isotopic  spin  or. 

The  pionic  decays  of  Z  and  A  may  also  be  described  by  requiring  invariance 
under  U^^\  We  have 

\  Pl   ^lI 
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where  a'  is  a  dimensionless  coupling  constant.  Expanding  0  in  powers  of  /, 
we  have 

+  (^r^-^i  -  ^jtr/^^^)  ^^9?''  +  (%7^^i  +  ^^r/^^i)  ^/.9^'  +  /^  •  c.  (28) 

The  |Zl  / 1  =  J  rule  is  again  seen  to  hold  good.  Also  this  gives  the  F  —  A  inter- 
action for  A  decay  and  for  the  n^  decay  mode  of  i7+  while  the  7r+  decay  mode  of 
E^  goes  through  V  alone  and  27~  decay  goes  through  A  alone  •'^. 

Now  the  current  generated  by  rotations  associated  with  parameters  /^  is  a 
mixture  of  vector  and  axial  vector  currents  and  reads 

h  =  n>y,^-^rrf  +  ^TT{xHd,H-),  (29) 

where  we  have  replaced  0  by  a  more  general  quantity  H.  The  only  restrictions 
on  it  are  that  it  is  unitary  and  that 

^(0)  =  ^'(0)  =1,     H^  a((p^)  +  6(^2)  i^  .  (p  (30) 

The  current  (29)  is  invariant  under  v  rotations.  (29)  may  be  written  as 

(31) 


vrArsT^^+^T^A^ 


jl  is  the  isotopic  spin  current  and  jf  the  current  conserved  in  the  pseudo vector 
coupling  theory.  While  jf  is  conserved  only  in  the  limit  of  vanishing  pion  mass, 
jX  is  conserved  even  if  ^  =|=  0,  provided  the  p  —  n  mass  difference  is  neglected. 
Only  a  charged  leptonic  current  l^  seems  to  enter  weak  interactions.  If  we  assume 
that  V^  transforms  like  jl  under  rotations  round  the  third  axis  in  '/^-space  a  weak 
Lagrangian  invariant  under  this  subgroup  is  obtained  by  taking 

L„  =  2-*/.  G  ijt  +  n)  m  +  ItP  (32) 

with 

G^lO-'m-^       jt=nyA^  +  75)  P',       it=  (^  +  /^)  yA^  +  y5)y 


1  An  attempt  has  been  made  by  A.  Ramakrishnan,  A.  P.  Balachandran  and  N.  G. 
Deshpande  (to  be  pubHshed)  to  determine  the  structure  and  parity  of  weak  interaction 
currents.  It  is  found  that  the  current-current  picture  of  weak  interactions  is  not  valid,  i.e. 
A  Q/A  S  =  —I  leptonic  decays  are  allowed.  The  current  is  F  —  ^  whenever  A,  p  or  S~  occurs 
(for  both  leptonic  or  non-leptonic  decay  mode).  The  27-27  current  is  a  pure  vector  and  all  other 
baryonic  currents  are  either  F  or  ^. 

The  suggestion  that  the  observed  27-decay  asymmetries  arise  from  decay  diagrams  involving 
A  and  S  has  been  made  by  A.  P.  Balachandran  and  K.  Venkatesan.  They  use  the  experi- 
mental result  (Fowler  et  al.,  Phys.  Rev,  Lett.  6,  134  (1961))  that  the  asymmetry  parameters  (xa 
and  (Xs  have  opposite  signs.  See  A.  P.  Balachandran  and  K.  Venkatesan,  Progr.  Theor. 
Phys.  26-5  (L),  792  (1961). 
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It  is  to  be  noted  that  L^y  contains  a  direct  pion-lepton  interaction  with  a  coupling 
constant  ^26^/2/  corresponding  to  the  limit  // ->  0.  When  the  finite  pion  mass 
is  taken  into  account,  this  will  be  a  function  of  jj,^,  say  c(^*)  with 

c(^»)  3  cm  =  V2G/2/  (33) 

if  c(//'^)  is  a  slowly  varying  function  of  fj,^.  In  this  case  c(0)  may  be  regarded 
as  a  good  approximation  to  the  actual  :r-lepton  coupling  constant.  This  is  just 
the  Goldberger-Treiman  estimate^  based  on  dispersion  relation  techniques. 
As  long  as  ju  can  be  neglected,  the  vector  and  axial  vector  renormalizations 
should  be  the  samq,  namely  unity,  since  otherwise  j;i  will  not  be  a  vector  under  fx 

rotations.  The  deviation  of  — ;= —  from  unity  is  then  ascribed  to  the  finite  pion 

mass  that  perturbs  the  4-dimensional  symmetry. 

The  form  factors  (and  consequently  the  ratio  of  the  renormalized  coupling 
constants)  occurring  in  weak  interactions  are  easily  calculable  in  the  pole  approxi- 
mation. In  general  we  have  to  calculate  the  matrix  element  of  the  current  operator 
between  one  nucleon  states,  pion  states  etc.  This  matrix  element  can  be  shown 
to  be  related  to  the  product  of  the  matrix  element  of  the  operator  between  vacuum 
and  all  the  allowed  intermediate  states,  and  the  transition  matrix  element  between 
these  intermediate  states  and  the  final  state  of  a  nucleon  and  antinucleon,  two 
pions  etc.  In  principle  the  conplete  set  of  intermediate  states  should  be  taken 
into  account.  But  in  the  pole  approximation  the  nearest  singularity  is  expected 
to  dominate.2 

Earlier  in  the  discussion  of  A  decay,  we  pointed  out  that  the  |^  / 1  =  ^Iz  rule 
can  be  built  into  the  Lagrangians  for  non-leptonic  decays  of  the  strange  particles 
provided  we  allow  neutral  currents.  The  chief  difficulty  in  such  an  introduction 
arises  from  the  neutral  lepton  currents  which  lead  to  many  unobserved  processes 
like  //^  ^  e"^  +  e"  +  e^  and  K^->e'^  +  e'.  Recently  neutral  baryon  currents  have  been 
included  in  theVeak  interaction  Lagrangian,  neutral  lepton  currents  however  being 
excluded.^  As  Lee  and  Yang*  have  shown,  the  present  experimental  evidence  is 
definitely  against  the  existence  of  the  neutral  lepton  currents.  Of  course  such  a  view 
of  the  structure  of  weak  interaction  goes  against  the  notion  of  the  universal 
Fermi  interaction. 

With  the  introduction  of  the  neutral  currents,  the  jg(.  current  will  form  an 
isotopic  vector  while  the  ^^^^  current  will  now  form  an  isotopic  spinor.  The  latter 


^  M.  L.  GoLDBERGER  and  S.  B.  Treiman,  loc.  cit. 

2  J.  Bernstein,  S.  Fubini,  M.  Gell-Mann  and  W.  Thirring  {Nuovo  Cim.  17,  757  (I960)) 
have  studied  the  pion  decay  problem  using  this  approximation  and  are  able  to  arrive  at  the 
Goldberger-Treiman  formula  for  pion  decay  (M.  L.  Goldberger  and  S.  B.  Treiman,  Phys. 
Rev.  110,  1478  (1958)). 

3  S.  B.  Treiman,  Nuovo  Cim.  15,  916  (1960). 

*  T.  D.  Lee  and  C.  N.  Yang,  "Implications  of  the  Intermediate  Boson  basis  of  the  Weak 
Interactions:  The  Existence  of  a  Quartet  of  Intermediate  Bosons  and  Their  Dual  Isotopic 
Spin  Transformation  Properties",  preprint,  Institute  for  Advanced  Study  (1960). 
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AS 
will   also   satisfy  the   rule     a^=^  +1-   The  jg^   will   contain  the  terms  NrN, 

SrS,  S '  A,  and  2J  x  S.  The  ;V»c  will  consist  of  terms  like 


s_ 

Ap 

'S-A' 

[p8,\ 

' 

—  An  _ 

' 

S^A 

' 

pZ-n-{-i:^p 
Z^n-pU^p 


p  S°  E^  -  S-  E^ 
5'o  E^  +  p  S-  E- 


(34) 


where  S~  and  Sq  satisfy  AQ  =  — 1  and  AQ  =  0.  If  we  now  write  the  total  weak 
interaction  as 

Hr.t  =  j,S_  +  j,S,  +  h-c,  (35) 

where  j+  and  Jq  are  the  charged  and  neutral  parts  of  the  current  jgc;  H^^  trans- 
forms as  an  isotopic  spinor. 

While  the  Universal  Fermi  Interaction  is  able  to  explain  most  of  the  known 
phenomena  relating  to  weak  interactions,  it  is  quite  possible  that  it  may  be 
inadequate  at  high  energies  and  large  momentum  transfers.  This  led  Lee  and 
Yangi  to  speculate  upon  the  possibility  that  weak  interactions  are  transmitted 
by  an  intermediate  vector  boson  field.  In  other  words  the  four  fermion  vertex 
spHts  up  into  two  vertices,  each  vertex  involving  two  fermions  and  the  boson  W 
which  connects  the  vertices.  The  splitting  of  the  vertices  limounts  to  introducing 
non-locahty  into  the  four  fermion  interaction. 

Detailed  considerations  led  Lee  and  Yang  to  postulate  that : 

(a)  There  must  exist  a  TT^,  a  W^,  a  W^  and  a  W', 

(b)  Their  spin  is  unity  in  order  to  transmit  the  observed  vector  and  axial 
vector  form  of  weak  interactions, 

(c)  Their  mass  must  be  >m^  in  order  to  explain  the  absence  of  K  ->W  +  y. 
Among  the  predicted  properties  are : 

(a)  The  non-locaUty  introduced  by  W  causes  the  Michel  parameter  g  to  deviate 

from  3/4  by  the  factor  j  \^^j  . 

(b)  In  the  non-local  interaction  /j,^  ^  e+  +  v  +  v,  by  coupling  the  vertices 
/^  -f  ?;->  IT"  and  W'  +  r->e",  which  implies  in  turn  that  the  neutrino  emitted 
in  the  first  vertex  is  absorbed  in  the  second,  the  decay//"  ->e~  +y  will  be  allowed. 
Since  this  decay  is  known  to  be  forbidden  in  nature  the  intermediate  boson  hypo- 
thesis can  be  saved  only  if  it  is  postulated  that  the  neutrinos  coupled  to  the  elec- 
tron and  muon  are  distinct,  so  that  the  absorption  of  the  emitted  neutrino  is  not 
possible.  Divergence  difficulties  also  arise  since  the  theory  of  vector  mesons 
interacting  with  photons  is  not  renormalizable. 

(c)  W^  can  decay  rapidly  (<  lO'^^  sec)  into  pions,  leptons  or  iT- mesons.  Their 
coupling  with  the  leptons  I  is  given  by: 

ifwyA^+y5)WvW;,-^h-c.  (36) 

1  See  T.  D.  Lee,  Proceedings  of  the  Tenth  Annual  Conference  on  High  Energy  Physics 
(Rochester),  pp.  567  et  seq.  T.  D.  Lee  and  C.  N.  Yang,  loc.  cit. 
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where  the  coupling  constant  /  is  given  by 


m?,-       _  10" 


O" 


Gv=-TJ^    -^  (37) 


1/2      '  ~    1/2 

I/ce  and  Yang  have  also  discussed  the  production  cross-section  for  W^  through 
the  reaction  v -\- Fe ->  W ^  +  ju-  +  Fe  and  estimated  it  to  be  '^lO-^s  cm^.  This 
coherent  production  will  dominate  at  low  momentum  transfer  to  the  nucleus, 
i.e.  at  high  energies.  The  spin  and  polarization  of  W^  can  be  determined  by  a 
study  of  its  decay. 

If  we  admit  the  validity  of  the  Zl/  =  ^/j  postulate  for  weak  interactions,  we 
are  constrained  to  postulate  the  existence  of  two  more  neutral  bosons  W^  and 
W^.  The  interactions  between  W  and  strongly  interacting  particles  should  conserve 
isotopic  spin.  For  such  a  conservation  in  the  reaction 

Ao:^P+W'  (38) 

the  W~  particles  must  be  assumed  to  have  I^7  =  ^/g.  Thus  the  W's  form  two  doub- 
lets like  the  iT-mesons.  However  in  the  reaction 

n:;tp+W-  (39) 

the  W  particles  must  have  integral  isotopic  spin.  They  are  thus  grouped  into  a 

triplet 

W\W^a,W'     with     Iw=l 

and  a  singlet 

^  Wt     with     lw  =  0  (40) 

where 

and 

m==y^(WO-WO).  (41) 

In  view  of  this  dual-isotopic  spin  property  thej^  are  called  schizons.  The  schizon 
can  be  considered  established  only  if  it  is  possible  to  devise  two  experiments  other 
than  the  dissociation  of  A  and  n  requiring  the  alternate  assignments  of  isotopic 
spin.  The  experiments  suggested  are : 

(1)  Decay  of  W^ 

B{W^-^K'^  +  7i9)  =  iB{W^->K^  +  7i'-)  (42) 

and  W^  ^  K^  +  n^ 

corresponding  to  ^y  =  ^/g,  and 

(2)  R{W^ ->  271^ +  71-)  =  R{W^  ^271^ +  71^)  (43) 

due  to  I^y  =  ^/g. 

More  difficult  experiments  involving  neutrino  capture  have  also  been  suggested. 


CHAPTER  VIII 

STRONG  INTERACTIONS  AND 
STRANGE  PARTICLES 

1.  K  PARTICLE  INTERACTIONS^ 

We  will  now  present  the  salient  features  of  the  experimental  work  on  the  inter- 
action of  K^  and  K~  with  nucleons  and  nuclei.  The  theoretical  implications 
of  these  interactions  especially  relating  to  the  determination  of  the  relative 
parities  of  strange  particles  will  be  discussed  later.  We  have  already  referred  to 
the  interesting  phenomena  due  to  the  interaction  of  K^  and  K^  with  nuclei. 
As  a  consequence  of  strangeness  conservation,  K^  interactions  with  nuclei  cannot 
lead  to  hyperon  production  while  K~  interactions  can,  a  theoretical  prediction 
confirmed  by  experiment. 

K+-interactions 

Scattering  of  K+  by  free  nucleons  can  be  described  by  the  following  processes : 

K+  +  p->  K+  +  "p  (direct  scattering)  (1) 

K-^  +  n-^  K+  +  n  (direct  scattering)  (2) 

-^  K^  +  p  (charge  exchange).  (3) 

The  first  two  reactions  are  the  direct  scattering  of  K^  by  protons  and  neutrons 
while  the  last  refers  to  the  charge  exchange  scattering  of  K^  by  neutrons.  K^  +  p 
reactions  can  be  studied  directly  in  the  hydrogen  bubble  chamber.  However, 
the  features  of  the  K+  +  n  scattering  have  to  be  deduced  by  combining  the 
knowledge  of  K^  +  p  and  iT^-nucleus  scattering,  e.g.  K^  interactions  with 
deuterium  (bubble  chamber)  which  is  the  simplest  nucleus  or  with  more  complex 
nuclei  (emulsions).  In  studying  these  K^  scattering  processes,  it  has  been  found 
convenient  to  classify  the  events  into  elastic  and  inelastic  events.  An  elastic  K^ 
scattering  event  is  defined  as  an  event  in  which  there  is  no  visible  excitation  of 
the  nucleon  or  nuclei  (emulsion)  and  consequently  there  is  no  loss  of  energy. 
Actually  it  is  very  difficult  to  establish  an  energy  loss  less  than  5  MeV  from 
measurements  on  K  particles  alone.  These  elastic  events  are  mostly  due  to  the 
Coulomb-scattering  especially  for  small  angles  of  deflection  while  the  large  angle 
scatterings  are  due  to  K^  nuclear  interactions.  By  inelastic  events  we  mean  those 

^  See  M.  F.  Kaplon,  International,  Conference  on  High  Energy  Physics  at  CERN,  Edited  by 
B.  Ferretti  p.  171  (1958). 
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in  which  K  particles  sufiFer  energy  loss  and  also  those  in  which  new  particles 
are  produced. 

In  K*  ^  p  scatt<?ring,  it  has  been  found  that  the  total  cross-section  remains 
constant  at  15  mb  and  the  angular  distribution  is  consistent  with  isotropy. 
Such  an  isotropy  implies  an  >S-wave  interaction  in  the  /  ^  1  channel  since  the 
ir+  +  />  system  can  exist  only  in  the  7=1  state. 
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Fig.  38.  Collected  emulsion  data  on  iC+  scattering  by  nucleons;  corrected  cross-sections. 
The  graph  shows  the  total  iT^— niicleon  cross-section  (full  line)  and  the  non-charge  ex- 
change cross-section  (interrupted  line)  (CERN). 


Study  of  the  inelastic  events  in  K^  +  p  scattering  reveals  that  the  relative 

ATk 
kinetic  energy  loss  —=^ —  {Tj^  is  the  kinetic  energy  of  K^  and  A  T^  is  the  loss  in 

-'  K 

kinetic  energy)  is  small  and  that  this  loss  is  even  smaller  at  lower  T^ .  Further  the 
inelastic  events  are  strongly  peaked  backwards  (see  Fig.  39),  a  fact  which  strongly 
suggests  a  repulsive  7iL'''-nucleon  potential.  P-wave  contribution  in  JT^-nucleus 
interaction  has  also  been  observed.  The  total  cross-section  on  heavy  pure  elements 
indicates  that  the  cross- section  per  nucleon  may  decrease  as  the  mass  number  A 
increases. 

Measurements  on  total  cross- sections  which  are  a  sum  of  the  contributions 
from  both  direct  and  charge  exchange  scatterings  of  K^  —  n  have  been  made  by 
attentuation  experiments  in  water  and  heavy  water  the  results  of  which  are 
plotted  opposite. 
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Fig.  39.  Inelastic  scattering  of  K^  mesons  by  emulsion  nuclei.  The  curve  shows  the 
expected  relation  (in  the  laboratory  system)  between  inelasticity Zl  TkJTk  and  0  (CERN). 
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Fig.  40  (a)  Graph  showing  deduced  ir+— neutron  cross-sections;  non-charge  exchange 
(full  line)  and  charge  exchange  (interrupted  line)  (CERN). 
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Since  the  K^  —  n  charge  exchange  scattering  cross-section  is  known  to  rise 
with  energy,  it  is  difficult  to  derive  the  K^  —  n  elastic  scattering  cross-section 
from  this.  However  as  noted  above,  K^  —  n  cross-sections  have  been  obtained 
by  a  detailed  analysis  of  ^T^-nucleus  inelastic  scattering  events  with  the  help 
of  the  known  K^  —  p  scattering  results.  K^  —  n  elastic  scattering  rises  to  a 
maximum  of  10  mb  at  about  200  MeV  and  falls  off  at  higher  energies.  The  ^+  —  n 
charge  exchange  scattering  cross-sections  have  been  obtained  through  counter 
experiments  by  subtraction  of  the  data  from  the  interactions  of  Z+  with  CD2 
and  CH2 .  Figure  (40  b)  below  gives  the  energy  dependence  of  the  charge  exchange 
scattering  as  well  as  the  ratio  of  the  charge  exchange  to  total  cross  sections  for 
K^  +  n  scattering. 
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Fig.  40  (b)  Energy  dependence  of  charge  exchange  to  total  cross-sections  for  K^-n 

scattering. 


The  striking  feature  of  K^  —  n  charge  exchange  cross-sections  is  its  rise  with 
energy  and  its  rf  dependence  where  y]  is  the  centre -of- mass  momentum.  The  latter 
result  implies  a  P-wave  contribution  to  the  K^  —  n  interaction.  It  is  interesting 
to  note  that  the  enhancement  in  the  K'^  —  n  charge  exchange  cross-sections  sets 
in  at  about  the  threshold  for  tt  production  in  K  —  N  collisions  (^^220  MeV  for 
the  scattering  of  a  K^  meson  on  a  free  nucleon).  Thus  from  this  brief  survey  of 
experimental  results  oiK'^  —  n  data,  we  find  that  at  low  energies  the  7=1  channel 
is  dominant  and  only  /S-wave  scattering  is  present.  However  at  higher  energies 
the  7=0  channel  becomes  important  but  the  8  wave  interaction  is  relatively  weak 
in  that  channel.  A  P  wave  interaction  is  required  by  the  backward  peaking  of 
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the  K+  —  n  cross-section.  The  decrease  in  the  K+  —  n  elastic  cross-section  can 
be  understood  by  an  interference  of  the  P  waves  in  the  /  =  1  and  7=0  channels.^ 


K  -interactions 


Since  the  I^  values  for  K'  and  p  are  respectively  —J  and  J,  the  scattering 
of  K-  by  protons  can  proceed  through  both  7=0  and  /  ==  1  channels.  We  have 


i^~  -r  p->  K-  +  p  (direct  scattering) 
K-  +  p-^  K^  +  n  (charge  exchange) 
K-  +  ^  ->-  Y  +  n       (inelastic  or  capture) . 


(4) 
(5) 
(6) 


However  the  interactions  oi  K-  with  neutrons  proceed  through  the  7=1  channel 
alone.  Capture  events  indicate  that  K-  is  absorbed  by  the  nucleon  after 
K~  comes  to  rest,  while  inelastic  events  refer  to  absorption  of  K'  of  non-zero 
momentum. 

K-  +  n -^  K~  +  n     (elastic)  (7) 

K-  -\-  n-^  Y  -\-  n       (inelastic  or  capture) .  (8) 

K-  +  p  elastic  scattering  has  been  studied  both  in  the  hydrogen  bubble  chamber 
and  in  emulsion.  The  angular  distribution  of  elastic  scattering  events  is  isotropic. 
The  K~  +  p  elastic  cross-section  rises  to  about  90  mb  at  about  120  MeV/c  and 
then  drops  to  about  40  mb. 
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Fig.  41.  Energy  dependence  of  K  -p  elastic  scattering  cross -section. 


^  See  R.  H.  Dalitz,  International  Conference  on  High  Energy  Physics  at  CERN,  Edited  by 
B.  Ferretti  p.  187  (1958). 
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Since  it  has  been  established  that  K^  is  heavier  than  K'  by  about  3.9  MeV,* 
the  charge  exchange  scattering  of  K~  by  protons  has  a  threshold  of  89  MeV/c. 
This  has  been  established  by  observing  the  charged  decay  mode  of  the  short 
lived  neutral  K^,  i.e.  the  mode  if  J  ->  tt^  +  7t~  which  accounts  for  34  per  cent  of 
the  decays  ofK^.  The  energy  dependence  of  the  cross-section  is  shown  below.  Both 
the  direct  and  charge  exchange  scattering  of  K~  by  protons  seem  to  fit  well  with 
a  zero  effective  range  approximation  assuming  only  S  wave  interaction.^ 
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Fig.  42.  Combined  emulsion  data  (75  events)  for  the  reactions  K~ -{•  p -*- Z^  +  7i^ 

(CERN). 


The  data  on  K-p  inelastic  interactions  are  given  in  Fig.  42.  At  high  momenta^ 
the  Z'^  production  cross-section  is  about  12  mb  and  is  sbghtly  larger  than  the 
production  cross-section  of  Z~  which  is  about  8  mb.  However  at  low  energies 
the  cross-section  of  Z'  increases  much  more  rapidly  than  that  of  27+.  It  is 
interesting  to  note  that  at  K~  momentum  of  100  MeV/c  the  Z'/Z'^  production 
ratio  is  <  1  while  for  K~  captiu'e  at  rest  it  is  2.  At  higher  energies  the  ratio  becomes 
unity.  The  emulsion  data  on  Z^  production  in  jfii"  -  ^  interactions  in  flight  do 
not  seem  to  support  this  rapid  variation  of  the  Z~IZ'^  ratio.  The  angular  distri- 
bution of  i7~  and  Z'^  produced  is  consistent  with  isotropy.  Dalitz^  has  analysed 


^  A.  H.  RosENFELD  et  al.  using  charge  exchange  scattering  of  K~  in  hydrogen  bubble 
chamber  find  that  K^  is  heavier  than  K~  by  about  3.70  —  0.07  MeV.  See  Phys.  Rev.  Lett. 
2,  110  (1959).  Crawford  et  al.  by  studying  associated  production  in  7i~-p  collisions  have 
shown  that  K^  is  heavier  than  K^  by  about  5  MeV.  See  Phys.  Rev.  Lett.  2,  112  (1959); 
Ceolin  et  al,  Nuovo  Cim.  13,  818  (1959). 

2  J.  D.  Jackson,  D.  G.  Ravenhall  and  H.  W.  Wyld,  Nuovo  Cim.  9,  834  (1958).  See  also 
R.  H.  Dalitz,  loc.  cit.,  R.  H.  Dalitz  and  S.  F.  Than,  Ann.  Phys.  8,  100  (1959). 

3  R.  H.  Dalitz,  loc.  cit.  Also  R.  H.  Dalitz  and  S.  F.  Tuan,  loc.  cit. 
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these  inelastic  events  in  terms  of  complex  phase  shifts,  again  assuming  an  S-wave 
interaction  and  zero  effective  range.  In  the  high  energy  ^~-^  interactions  an 
event  giving  rise  to  the  S  particle  has  been  observed.  The  process  is 


K 


p-^^ 


K^  +  7t+. 


(9) 


In  ail  attempt  to  observe  the  cascade  particle,  K~  interactions  with  nuclei  in 
flight  have  been  studied  and  the  following  processes  observed: 


K-  +  N  ^  K-  +  N  +  71  -\-  71. 


(10) 

(11) 

(12) 


The  energy  spectrum  of  the  inelastically  scattered  K~  ranges  from  6  to  270  MeV 
which  is  possible  because  of  the  frequent  productiom  of  7i  mesons. 

As  in  the  case  of  iL+-nucleus  interactions,  the  iC "-nucleus  data  can  be  used 
to  yield  information  on  the  iL""-nucleon  potential.  To  this  end,  optical  model 
analyses  of  ^"-nucleus  data  have  been  carried  out.  The  imaginary  part  of  the 
potential  is  found  to  be  about  10  MeV  and  the  real  part  negative  indicating  an 
attractive   potential.  Further  a  study  of  the  inelastic  K~  -  p  events  reveals  a 

ATk- 
relatively  larger  kinetic  energy  loss  with  — ^ —  =-  0.45,  and  angular  distribution 

^  E 
which  is   peaked  forward.   These  two  strongly  suggest  ari  attractive  K~ -nuclear 
potential. 

AppUcation  of  the  principle  of  charge  independence  to  the  K~  -  p  or  K~  -n 
capture  reactions  leads  to  interesting  relations  among  cross-sections  for  the 
production  of  charged  and  neutral  27  particles.  Of  particular  interest  is  the  pro- 
duction of  neutral  and  charged  hyperons  (27  and  A)  in  the  capture  of  K~  -p 
by  nucleons.  The  K-p  capture  reactions  are: 


K-  +  ^->/to+  n^ 

->i7-+7r+ 
->i7o+7rO 

->E^  +  7z-. 

Denoting  the  reaction  amplitude  by  a^  and  a^  for  production  of  i7's  in  the  / 
and  /  =  1  channels,  we  obtain  the  relation 


(E''7t-):{E0  7^):(S-n*)  =  \ 


1/3        1/2 


6 


1/3 


1/2 


(13) 

(14) 
(15) 
(16) 

0 

(17) 


Thus  by  comparison  with  experimental  data,  we  can  only  deduce  the  relative 
phase  between  a^  and  a-^ .  Since  A^  has  7  =  0,  the  A^  +  7i^  system  can  exist  only  in 
the  7=1  state.  Thus  if  we  denote  by  h,  the  corresponding  production  amplitude 
19* 
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the  A^  +  Tif^  production  cross-section  is  proportional  to  .  From  K~  captures 
in  hydrogen,  it  has  been  found  that 

Z-:2:-':2:^:A^  =  8:4:4:1.  (18) 

To  fit  with  data,  the  phase  between  Uq  and  a^  is  required  to  be  108°.  This  ratio, 
as  we  shall  see  later,  is  inconsistent  with  the  prediction  by  the  "global  symmetry" 
model  of  strong  interactions  due  to  Grell-Mann.  Besides  the  above  reactions,  there 
also  exist  two  reactions 

K-  +  p^A^  +  71+  +  71- 

-^A^  +  7Z^+  7t^.  (19) 

A  study  of  these  can  throw  light  on  the  relative  K  —  A^  parity  (see  chapter  VII.3). 
The  capture  of  K-  by  neutrons  can  lead  to  the  following: 

K-  +  71^1-  +  71^  (20) 

->i:^+  7t-  (21) 

->A^+7t-  (22) 

^A^-i-  71-  +  71^.  (23) 

The   last  reaction   has   been   observed.   The  R-  —  n  interactions  can  proceed 

only  through  the  /  =  1  channel.  We  thus  find  the  ratio -y^  for  K'  -n  capture  to 

be  1.  A  more  interesting  relation  is  obtained  if  we  consider  the  ^"capture  by 
a  target  containing  an  equal  number  of  protons  and  neutrons,  i.e.  deuterium  and 
hehum.  By  adding  the  K'  -p  and  R-  -n  production  amplitudes,   we   obtain 

N(Z-^)  +  N{E-)  =  2N(E^)  (24) 

where  N(Z)  denotes  the  number  of  hyperons  produced.  This  relation  seems  to 
be  confirmed  by  the  R-  -  d  capture  reactions  observed  in  the  deuterium  bubble 
chamber.  The  data  from  the  deuterium  bubble  chamber  are 

N(E-^  +  E-)^  160  4-8 

(25) 
N(Z^)  =.    86±  6 

Another  feature  of  the  R'-d  capture  is  that  the  Z"  :  27+  :  27°  :  A^  value  is 
strikingly  different  from  that  in  hydrogen^  (see  table  below).  In  view  of  the  fact 
calculations  based  on  impulse  approximation  agree  with  the  experimental  data, 
this  difference  seems  to  indicate  that  the  interaction  amplitudes  are  momentum 
dependent,  i.e.  the  interaction  amplitudes  are  influenced  by  the  Fermi  motion 
within  the  nucleus. 

Capture  of  R-  by  nuclei  has  been  extensively  studied  in  emulsion  nuclei  and 
the  data  have  been  given  a  phenomenological  interpretation.  One  of  the  main 


1  R.  D.  Tripp,  Int.  Conf.  on  High  Energy  Physics  at  CERN  p.  184  (1958). 
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difficulties  in  the  emulsion  work  is  that  the  neutral  particles  cannot  be  detected 
and  hence  the  reduction  of  data  requires  a  complicated  correction  procedure  to 
infer  the  number  of  X^,  A^  and  n^  emitted.  There  is  clear  evidence  for  a  multi- 
nucleon  interaction  of  K~  from  the  ^"-nucleus  capture  data.  This  is  in  contrast 
to  the  usual  assumption  that  Kis  captured  by  an  individual  nucleon  inside  the 
nucleus.  If  hyperon  production  proceeds  through  a  single  nucleon  interaction, 
then  the  emitted  H  hj^erons  can  have  a  maximum  energy  of  60  MeV  corre- 
sponding to  an  interaction  with  a  single  nucleon  of  Fermi  momentum  of  about 
240  MeV/c.  However  several  high  energy  hyperons  have  been  observed.  According 
to  the  data,  about  40  per  cent  of  the  hyperons  produced  are  due  to  multi- 
nucleon  interactions.  The  multi-nucleon  interactions  are 

K-  +  p  +  p-^Z+  +  n  (26) 

-^Z^+p  (27) 

->A^  +  p  (28) 

K-  +  p+n^Z-  +  p  (29) 

->i:«  +  w  (30) 

->A^  +  n  (31) 

K-  +  n  +  n-^H-  ^  n.  (32) 

Another  important  feature  of  the  multi-nucleon  interaction  is  that  the  hyperons 
are  unaccompanied  by  7i  mesons.^ 

The  iL~-nucleus  data  yield  some  information  on  the  site  of  the  capture.  It  is 
found  that  in  -K" "-capture  at  rest  there  is  relatively  little  absorption  of  the 
accompanying  7r-meson  while  in  interactions  in  flight  the  number  of  tt's  that 
emerge  from  the  nucleus  is  greatly  reduced.  The  implication  is  that  in  interactions 
in  flight,  the  hyperon  production  takes  place  much  deeper  inside  the  nucleus  and 
thus  there  is  a  greater  chance  of  the  tz  being  absorbed,  while  Jr~-capture 
at  rest  is  more  of  a  surface  phenomenon.  It  is  also  found  that  there  is  less  of 
multi-nucleon  interaction  in  captures  in  flight.  These  facts  can  be  used  to  con- 
clude that  the  correlated  pairs  of  nucleons  occur  much  more  frequently  at  the 
surface  of  the  nucleus.  In  the  table  below  are  given  the  various  percentages  for 
the  production  of  hyperons  through  various  modes  in  emulsion,  Dg  and  Hg. 
It  is  assumed  that  emulsion  nuclei  contain  an  equal  number  of  protons  and 
neutrons.  The  K^  +  n  data  in  H  are  computed  using  the  charge  independence 
hypothesis.  In  emulsion  data,  it  is  assumed  that  the  multi-nucleon  processes  are 
responsible  for  30  per  cent  of  the  hyperons.  It  is  obvious  from  the  table  that 
the  data  from  elementary  K~  -  p  and  K~  -  n  interactions  from  H  do  not  hold 
for  the  Dg  ^^^  emulsion  nuclei. 


^  "X~  collaboration  experiments",  Parts  I  and  II,  Nuovo  Cim.  13,  670  (1959),  14,  315 
(1959). 
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Hyperon  production  by  K 

-■capture.^ 

Interaction  process 

Hydrogen 
bub.  oh. 

Deuterium 
bub.  oh. 

Emulsion 

i:-  +  p->2:-  +  jr+ 

29.5 

24.5 

12.0 

-^£■'  +  71- 

14.8 

21J6 

21.5 

-►  2;»  +  ;r« 

14.8 

20.8 

12.4 

->/l«  +:7iO 

3.7 

8.0 

12.1 

A'-  4  n  ->  Z-  +  7r° 

(14.8) 

4.6 

8.9 

-^U^  +n- 

(14.8) 

4.6 

8.9 

->ylo  +n- 

(7.4) 

16.0 

24.2 

-^n-jn^  ratio 

1.76  ±  1.0 

1.75  ±0.3 

4.5  ±  0.9 

E-IE*  ratio 

3.2  ±1.3 

0.82  ±  0.2 

0.97  ±  0.3 

It  is  also  noticed  that  the  n"  jn'^  ratio  of  4.5  from  jfiT-captures  at  rest  is  very 
large  compared  to  the  value  of  1.5  obtained  assuming  charge  independence. 
Another  interesting  feature  in  the  ^--captures  by  nuclei  and  Dg  is  the  enhance- 
ment of  yl^  production  which  requires  the  charge  exchange  scattering  of  hyperons 
as  we  shall  see  below. 


2.  HYPERONS  AND  HYPERFRAGMENTS 

Production  of  hyperons 

The  associated  production  of  hyperons  and  ^-mesons  can  be  achieved  with 
71-N  and  N-N  collisions.  The  following  reactions  leading  to  the  production  of 
hyperons  have  been  studied  in  n-N  collisions : 

Threshold  momenta  MeV/c 

71-  ^-p^A^^  K"^  ^    900 

7i-  +  p^E-  +  K'^  ^1050 

7t^  +  p-^E^  +  K^  ^1040  (1) 

These  interactions  have  been  studied  in  the  range  930-1430  MeV/c  of  tt"  momen- 
tum. Many  events  with  A^  production  have  been  observed.  The  present  experi- 
mental results  are  as  follows :  ^ 

(1)  All  the  observed  total  cross- sections  lie  in  the  interval 

0.1  mb<or<  Imb. 

(2)  The  total  cross-section  for  A^  rises  from  the  threshold  to  a  peak  of  about 
0.8  mb  near  llOOMeV/c  and  the  drops  again  to  0.2— 0.3  mb  in  the  interval 

1  The  table  is  reproduced  from  a  report  prepared  by  A.  C.  Melisainos  on  behalf  of  the 
Rochester  group.  We  thank  Dr.  Sudarshan  for  making  available  this  report. 

2  Report  by  J.  Steinbergbr,  International  Conference  on  High  Energy  Physics,  GERN, 
Ed.  B.  Feretti,  p.  147  (1958). 
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1250-1450  MeV/c.  Similar  behaviour  has  not  been  observed  in  the  i7~  and  Z^ 
cross-sections. 

(3)  The  most  remarkable  result  concerns  the  angular  distributions.  It  has  been 
noticed  even  before  that  while  the  charged  hyper ons  are  peaked  forwards,  the  neutral 
hyper ons  are  always  peaked  backwards.  In  the  above  experiments  it  is  found 
that  the  angular  distribution  of  A^  does  not  change  in  the  interval  of 
1000-1450  MeV/c  for  7i~  momenta.  The  backward  peaking  of  the  Z^  is  less 
marked  compared  to  that  of  the  A^. 

A  very  interesting  method  to  study  the  spin  of  the  hyperon  is  due  to  Adair  ^ 
who  suggested  that  the  angular  distribution  of  the  decay  products  of  the  hyperons 
produced  in  the  forward  or  backward  directions  in  the  cm.  system  of  tt"  -  ^ 
will  depend  only  on  the  spin  of  the  A^.  An  assumption  essential  to  the  argument 
is  that  the  spin  of  the  K~  meson  is  zero.  It  is  easy  to  see  from  conservation  of 
J  and  Jz  that  for  production  angle  0  =  0  or  tt  in  the  cm.  system,  A^  will  be 
produced  with  m^  =  J  or  — J  with  equal  probability  and  there  will  be  no  phase 
correlation  if  we  perform  jt"  -  p  experiments  with  unpolarized  targets.  The 
component  of  the  angular  momentum  of  the  A^  vanishes  along  6  =  0  ot  n.  Thus 
for  spin  ^/g,  A^  will  be  produced  unpolarized  and  hence  the  angular  distribution 
of  its  decay  products  will  be  isotropic  If  A^  has  higher  spins  like  ^/g  or  ^j^  it  will 
be  produced  with  polarization  since  all  its  spin  states  are  not  equally  populated 
{A^  will  still  be  produced  only  in  the  m^  =  J  and  — J  states).  So  the  angular  distri- 
bution of  the  decay  products  will  be  anisotropic  and  the  form  for  spin  ^/g  will 
be  (1  +  3  cos2  0).  The  above  considerations  can  be  applied  for  0  =  0  or  tt  provided 
the  contribution  of  higher  angular  momentum  waves  (i.e.  rrii  4=  0)  can  be  assumed 
to  be  very  small.  Since  the  hyperons  are  produced  fortunately  with  forward  or 
backward  peaking,  recent  experiments^  indicate  that  hyperons  have  spin  ^/2. 

A  few  experiments  on  the  production  of  hyperons  by  3  BeV  protons  incident 
upon  nuclei  have  been  done.  The  hyperons  and  mesons  were  detected  by  observing 
the  y-rays  originating  from  regions  several  centimeters  down  stream  from  the 
target.  The  following  reactions  are  possible. 

p^  'p-^K+  +  A^  +  p 

^K-^  +  Z^  +  p  ■■  (2) 

->K^  +  1+  +  p  , 

-^K+  +  Z+  +  n  (not  observed)  (3) 

p+n-^K^  +  A^  +  p      1 

[  (4) 

->K^  +  i:^  +  p       \ 

-^K+  +  Z"  +  p  (not  observed)  (5) 

■^K+  +  A''  +  n' 

1  R.  Adair,  Phys.  Bzv.  100,  1540  (1955). 

2  F.  EiSLER  et  ah,  Nuovo  Cim.  7,  222  (1958). 
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Making  the  assumption  that  there  is  only  one  production  interaction  (i.e.  neglecting 
multiple  interactions)  it  was  estimated  that  the  cross-sections  are  40  ±  6  ^b, 
210  ±  30  and  420  ±  60//b  for  (2),  (4)  and  (6)  respectively,  i  The  results  seem 
to  show  that  the  cross-section  for  associated  production  in  p- p  collisions  is 
significantly  smaller  than  the  cross-section  in  p-n  collisions.  It  has  also  been 
found  that  the  cross-sections  for  the  production  of  2*+,  A^  or  K^  are  extremely 
small  for  p-  p  colhsions.* 

Hyperon  interactions 

Some  events  corresponding  to  the  interaction  of  hyperons  at  rest  and  in  flight 
have  been  observed.  Elastic,  inelastic  and  charge  exchange  scattering  events 
have  been  identified.  There  is  also  the  possibility  of  hjrperons  forming  bound 
states  with  nucleons,  i.e.  the  formation  of  hyperfragments  which  will  be  discussed 
presently. 

2J~  -  interactions 

Z~  capture  reactions  have  been   studied  in  the  hydrogen  bubble   chamber. 

The  reactions  are 

E-  +  p^A^-\-n  (7) 

E-  ^-p-^E^+n.  (8) 

E^ 
The  present  data  yield  a  ratio    yo  ,    iF  ^  ^•^^-  -^^  captures  in  D  can  proceed 

as  follows: 

Z-  +  D  ->  yio  +  w  +  w.  (9) 

E-  +  'D->E'^+  n+  n.  (10) 

This  yields  a  ratio  -y^ -r^  ^^  0.037  ±  0.022.^  Fortunately  it  is  easy  to  distingmsh 

the  A^  that  are  produced  directly  from  the  A^  that  arise  from  the  decay  of  thei7^ 
since  the  latter  A^'s  have  a  momentum  less  than  134  MeV/c.  The  difference  in  this 
ratio  for  H  and  D  can  be  explained  assuming  the  impulse  approximation  in  the 
E~-T>  interactions.  In  order  to  account  for  the  enhanced  A^  production  there  is 
no  need  to  invoke  two-step  E~-D  interactions  such  as 

E-  +  D--^E^+n  +  n->A^  +  n  +  n.  (11) 

However  there  is  strong  evidence  for  such  a  two-step  process  in  the  K~-D  capture. 
It  is  to  be  noted  that  this  two-step  process  is  essentially  a  charge  exchange 
scattering  of  hyperons.  As  for  E'-D  interactions,  it  is  quite  easy  to  distinguish 
the    directly    produced   A^    from    those    produced    in    the   decay    of   the    E^ 


1  D.  Berley  and  G.  B.  Colliers,  Phys.  Rev.  112,  614  (1958). 

2  Cool  et  al.,  Phys.  Rev.  108,  1048  (1957). 

3  O.  Dahl  et  al,  Phys.  Rev.  Lett.  4,  77  (1960). 
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in  the  K~-T>  capture.  Under  the  impulse  approximation  we  can  write  the  K~-D 
interaction  as 

K-  +  T>~^{K-+n)  +  p-^A^  +  71-  +  p^{i:^  +  71-)  +  p.  (12) 

Since  hyperon  production  is  a  two  body  reaction,  jt"  must  have  a  unique  momen- 
tum smeared  out  only  by  the  Fermi  motion  of  the  nucleon.  The  tt's  accompanying 
the  H  have  a  peak  at  about  182  MeV/c  indicating  the  validity  of  the  impulse 
approximation.  However  the  tc's  accompanjdng  the  A^  exhibit  two  peaks  one 
at  253  MeV/c  and  another  at  182  MeV/c.^  This  interesting  phenomenon  impHes 
that  some  of  the  tt"  accompanjdng  the  A^  were  partners  to  Z^  or  i7+  according 
to  the  following  reactions  which  proceed  in  two  steps. 

I         {K-  +  n)  +  p^  (Zo  +  71-)  +  p  (13) 

{K-  +  p)  +  n-^  {1+  +  n-)-^n  (14) 

II         K- +  I)-^I^+ 71- +  p-^A""  +  p+ 71-  (15) 

K    +  D->i;+  +  TT-  +  n->A'^  +  p+  71-  (16) 

The  last  step  refers  to  the  production  of  A^  by  charge  or  particle  exchange  of 
the  hyper ons. 

Very  few  clear  data  exist  on  27"  —  p  elastic  scattering  events.  It  has  been 
estimated  that  the  probability  of  escape  of  iT"*",  E-  and  Z,^  from  nuclei  is  45  per  cent 
corresponding  to  a  mean  free  path  of  0.5  x  10~^^  cm  in  nuclear  matter. 

The  following  events  concerning  A^  interactions  have  been  observed  which 
correspond  to  elastic  and  charge  exchange  scatterings: 

A^+p->A^  +  p  (17) 

/lo  +  :P^i:+  + w.  (18) 

The  second  reaction  is  exothermic  and  has  a  threshold  at  635  MeV/c. 

Hyperf  ragments  ^ 

We  have  seen  that  the  strange  particles  that  have  been  discovered  are  divided 
into  two  classes,  the  hjrperons  and  the  heavy  mesons.  The  hyperons  along  with 
the  neutron  and  the  proton  are  called  baryons  or  the  heavy  particles.  It  was  found 
interesting  to  investigate  the  possibility  of  the  existence  of  nuclei  in  which  some 
of  the  neutrons  were  replaced  by  hyperons.  Such  a  speculation  was  encouraged 
when  a  very  unusual  event  was  reported  by  Danysz  and  Pniewski^  in  which  a 
nuclear  fragment  from  a  cosmic  ray  star  stopped  in  the  emulsion  and  subsequently 
disintegrated  with  an  energy  release  of  about  140  to  180  MeV.  There  are  two 
possible  explanations  for  such  an  event.  (1)  The  fragment  might  have  contained 


1  A.  Fujii  and  R.  E.  Marshak,  'Nuom  Cim.  8,  643  (1958). 

2  Parts  of  ibhis  section  are  based  on  "Some  Properties  of  Uncommon  Forms  of  Matter",. 
Lectures  given  by  P.  Morrison  at  the  University  of  Alberta,  Edmonton,  Canada  (1957). 

3  M.  Danysz  and  J.  Pniewski,  Phil.  Mag.  44,  348  (1953). 
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a  jT"  in  its  orbit  and  after  the  fragment  stopped  it  could  have  been  absorbed 
with  consequent  release  of  energy.  (2)  The  fragment  might  have  contained  a  A^ 
which  decayed  after  the  fragment  had  stopped  with  a  corresponding  release  of 
energy.  Further  experimental  work  confirmed  the  second  possibility  and  for  such 
nuclei  the  name  "hyperfragment"  or  "  hypernucleus"  was  proposed  by  Goldhaber. 
Once  such  hyperfragments  were  observed  in  cosmic  ray  events,  experiments 
in  high  energy  accelerators  with  protons  of  energies  of  the  order  of  BeV  followed. 
The  frequency  of  occurrence  of  hjrperfragments  was  found  to  be  of  the  order  of 
1  in  1000  for  3  BeV  protons  and  1  in  500  for  1.5  BeV  mesons.  A  typical  cosmic 

^'5 


Fig.  43.  A  typical  high-energy  primary  particle  interaction. 

ray  event  involving  a  hyperfragment  can  be  described  as  follows.  A  fast  primary 
particle  enters  at  (1)  and  suffers  a  nuclear  collision  at  (2)  giving  rise  to  a  spray  of 
mesons  and  some  other  particles.  Among  these  is  one  which  makes  a  heavy  track  (3) 
and  has  a  short  range.  It  comes  to  rest  at  (4)  and  then  disintegrates.  The  track 
indicates  a  nuclear  mass  but  it  is  not  an  ordinary  nucleus  since  one  of  the  decay 
products  is  a  pion  (5).  Hence  we  conclude  that  the  track  is  probably  that  of  a 
hypernucleus.  The  observation  of  a  not  very  heavily  charged  recoiL  product 
together  with  a  proton  and  a  pion  and  an  energy  release  of  about  37  MeV  was 
interpreted  as  the  decay  of  a  yl^  in  the  nucleus. 

We  shall  here  consider  only  light  hyperfragments  containing  A  particles.  We 
should  except  the  simplest  hypernucleus  tohe  slA^  -r  p  system.  This  is  analogous 
to  the  deuteron  and  the  forces  involved  are  of  the  same  nature  and  approximately 
of  the  same  magnitude  as  the  nuclear  forces.  The  system  denoted  by  ^H^  if  it 
exists  should  be  easy  to  observe  having  a  track  corresponding  to  a  charge  Z=l 
(Fig.  44).  The  particle  would  come  to  rest  and  decay  into  two  protons  with  an 
energy  release  of  about  30  MeV.  Such  an  event  has  never  been  observed  and  hence 
it  is  concluded  that  ^H^  does  not  exist  or  that  the  {A^,  p)  system  has  no  bound 
state.  Several  events  of  the  type  shown  in  (Fig.  44)  have  been  observed  in  which 
all  the  particles  corresponding  to  the  observed  tracks  have  their  momenta  in  the 
forward  direction.  Since  the  initial  particle  comes  to  rest  before  decaying  there 
must  be  a  neutral  decay  particle  in  the  backward  direction  due  to  the  conservation 
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of  momentum.  Hence  the  initial  hypernucleus  must  be  yl^  +  ^  +  p,  i.e.  ^H^. 
In  a  similar  manner,  we  can  analyse  other  hjrpernuclei. 

Analysis  of  hjrpernuclei  consists  in  measuring  the  ranges  of  the  decay  products 
and  balancing  the  energy  and  momentum  assuming  one  or  two  recoil  neutrons. 
In  this  way  the  energy  of  the  decay  is  obtained  and  since  the  decay  products 
are  also  known,  the  binding  energies  can  be  calculated. 

The  majority  of  the  hypernuclei  that  have  been  studied  experimentally  so  far 
are  those  with  a  A^  replacing  a  nucleon.  We  know  that  the  A^  has  been  assigned 


z--\  y  z=i 


(B) 
Fig.  44.  yl-hyperfragment  decay. 


the  isotopic  spin  zero  since  it  has  no  charged  counterpart.  Its  spin  is  half  integral 
and  probably  equal  to  half.  We  assume  that  the  /t^-nucleon  forces  have  a  short 
range  and  are  charge  independent.  By  comparison  of  the  energy  levels  of  the  hghter 
hypernuclei  with  those  of  ordinary  nuclei  it  is  found  that  the  hypernuclei  can  be 
obtained  by  the  simple  addition  of  a  A^  to  the  appropriate  nucleus.  This  means 
that  the  A^  is  not  bound  to  the  nucleons  by  very  strong  forces  though  they  may 
be  of  the  order  of  nu clear  forces.  It  has  been  shown  by  Dalitz^  and  Downs  that  theyl- 
nucleon  forces  should  be  somewhat  weaker  than  the  nucleon-nucleon  forces  as  a  con- 
sequence of  charge  independence.  The  argument  is  as  follows :  If  theyl-nucleon  forces 
are  comparable  to  the  nucleon-nucleon  forces,  then,  since  the  Pauli  principle  will  not 
be  effective  for  a  A^  bound  to  a  nucleus,  the  binding  energy  B^i  of  the  A^  should 
be  quite  large  compared  to  the  corresponding  binding  energy  B^  of  the  neutron 
for  which  the  Pauli  principle  provides  an  additional  strong  repulsion.  But  actually 
Byi  <  B,^  and  hence  the  above  conclusion.  But  this  does  not  imply  that  the  A^- 
nucleon  coupling  is  weaker  than  the  nucleon-nucleon  coupling  since  yl^-nucleon 
forces  are  probably  of  a  shorter  range  and  therefore  less  effective  in  binding. 
It  is  also  found  that  the  binding  energy  tends  to  increase  with  the  mass  number. 
The  facts  that  the  hyperfragments  ^H*  and  ^He^  exist  and  that  the  binding 
energy  of  the  A^  particle  in  ^Be^  is  greater  than  the  binding  energy  of  the  last 
neutron  in  Be^  show  that  the  Pauli  principle  does  not  apply  for  a  yl^  in  a  nucleus. 


1  R.  H.  Dalitz  and  B.  Downs,  Phys.  Rev.  110,  958  (1958);  111,  967  (1958).  These  papers 
contain  references  to  earlier  works  on  hyperfragments.  See  also  S.  Iwao,  Progr.  Theoret.  Phys. 
Japan  13,  111  (1955). 
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The  low  binding  energies  of  the  light  hypernuclei  imply  that  the  A^  spends  much 
of  its  time  outside  the  nucleus  as  is  shown  by  the  fact  that  in  general  the  momenta 
of  the  A^  (computed  from  those  of  ;r~  and  p)  are  quite  low. 

Since  the  nucleon-nucleon  forces  are  spin  dependent  it  is  interesting  to  in- 
vestigate whether  such  spin  dependent  forces  exist  for  a  ^-nucleon  system.  If 
such  forces  were  to  exist,  then  the  /l-nucleon  system  will  be  in  the  triplet  or  the 
singlet  state  (assuming  the  spin  of  the  A^  to  be  half).  A  convenient  system  for 
such  an  analysis  is  the  hypernucleus  ^He*.  The  A  particle  is  loosely  bound  to 
the  stable  (\- particle  core.  Taking  up  the  shell  model  approach,  to  iind  the  inter- 
action potential  we  assume  that  the  total  potential  is  the  sum  of  the  average  of 
potentials  between  the  A^  particle  and  the  individual  nucleons.  We  then  have 

VAirA)  =  Sj  Va-n^Ta  -  r^)  Q{r^)  d«r^  (18) 

where  F^  _  j^  is  the  potential  between  the  A^  and  a  nucleon  and  q  (r^)  is  the  nuclear 
density.  Since  the  forces  are  assumed  to  be  charge  independent,  we  sum  over  all  the 
nucleons.  Assuming  the  range  of  the  ^"-nucleon  forces  to  be  so  small  that  we  can 
writ^ 

Va -KirA-  »-,v)  =  r^_t,HrA-  r^)  (19) 

we  have  ^ 

VAirA)  =  I]{VA-N)Q{rA)'  (20) 

N=l 

These  approxilnations  correspond  to  the  situation  where  the  yl-particle  is  mostly 
outside  the  nucleus  and  '  feels'  the  nuclear  forces  only  when  it  overlaps  the  nuclear 
matter. 

With  the  above  form  of  Va  and  an  appropriate  wave  function  for  A^,  we  can  esti- 
mate <T>  and  <F>,  the  mean  kinetic  and  potential  energies.  Using  the  zero  range 
function  of  the  deuteron  theory  we  can  match  the  observed  values  of  the  binding 

4 

energy  values  with  the  calculated  ones  so  that  we  obtain  a  value  for  u^  =  UVan-  ^Y 

1 
this  method,  Dalitz  and  Downs ^  have  obtained  u^  ^  660  MeV  /^  (/  =  1  x  1 0-^^  cm), 
Q  {^n)  being  assumed  from  experimental  results  on  the  electron  scattering  by  He* 
nuclei.  An  overall  error  of  ten  to  twenty  per  cent  was  expected  from  the  nature  of 
the  approximation  made  for  the  wave  function  of  ^He^.  In  this  case,  we  consider 
the  interaction  of  ^^  with  two  nucleons  having  spins  parallel  to  it  and  two  having 
spins  antiparallel  to  it.  If  the  spin  of  the  A^  is  half,  then  it  can  form  triplet  and 
singlet  states  with  a  nucleon  depending  on  its  spin  orientation,  i.e., 


u^  =  4:ijUt  +  ^UuA     and     i^«  = -^  =  165  MeV  f 
where  Uf  is  the  triplet  potential  and  Ug  the  singlet  potential. 


(21) 


1  R.  H.  Dalitz  and  B.  Downs,  loc.  cit. 
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Let  US  consider  the  hypernuclei  ^^e*  and  ^H*  corresponding  to  the  addition 
of  a  A^  to  He^  and  H^  nuclei  respectively.  Analyses  can  be  made  similar  to  that 
of  ^  He^  except  that  the  core  consists  of  a  coupled  pair  of  particles  of  opposite 
spins  and  an  extra  nucleon.  The  spin  of  the  A^  is  either  parallel  or  antiparallel 
to  that  of  the  extra  nucleon.  One  of  these  orientations  is  favoured  and  the  inter- 
-action  of  the  A.^  with  the  unpaired  nucleon  in  this  favoured  state  is 

^/  =  %  —  2  %  (22) 

where  u^  is  the  potential  of  interaction  of  the  A^  wdth  the  three  nucleons  and  ^u^ 
that  with  a  pair. 

Assuming  the  value  of  720  MeV  /^  for  u^ ,  we  obtain  Uf  ^  390  MeV  f.  The  large 
difference  between  u^  and  Uf  shows  that  the  A^  —  N  forces  are  spin-dependent. 
Similar  calculations  have  been  carried  out  for  ^Li'  gi"^ing  consistent  results. 
This  explains  also  why  ^H^  is  more  tightly  bound  than  ^He^.  Only  in  ^H^  can 
both  the  nucleons  have  spins  in  the  favoured  direction  with  respect  to  the  A^  spin. 
But  this  does  not  confirm  that  the  spin  of  the  A^  is  half  and  also  it  has  not  been 
possible  to  find  the  relative  directions  of  the  spins  of  the  A^  and  the  nucleons. 

To  determine  whether  the  favoured  orientation  for  a  A  nucleon  system  is  a 
singlet  or  a  triplet  state  we  consider  the  simplest  hypernucleus  ^H^.  The  proton- 
neutron  part  is  assumed  to  be  a  deuteron,  i.e.  in  the  triplet  state  and  the  trial  wave 
function  used  is 

i^  =  e-«(^i+^si)e-^^  (23) 

with  a  and  ^  as  variational  parameters  and  e"^^  the  deuteron  wave  function  in 
the  triplet  spin  state.  It  is  to  be  noted  that  we  have  been  able  to  assume  such 
a,  wave  function  since  the  A^  is  loosely  bound  to  the  other  two  nucleons.  This 
approximation  will  not  be  valid  for  a  three  nucleon  system  such  as  H^  or  He^. 
To  decide  between  the  two  spin  orientations  consider  the  cases  when: 

(A)  All  spins  are  parallel 

The  binding  potentials  should  satisfy  the  condition  u^  >  Ug  and  there  is  only 
one  potential  to  be  defined  from  the  experiment  since  all  pairs  are  in  the  triplet 
state.  We  have  essentially  an  effective  range  theory  as  in  the  S  wave  scattering 
in  low  energy  physics.  Solving  for  the  scattering  amplitude  with  different  potential 
shapes  and  ranges  the  variation  is  only  ±  10  per  cent.  The  effective  range  oct  is 
found  to  be  negative.  Hence  there  is  no  bound  state  for  the  two-body  system. 
The  results  for  thermal  scattering  cross-sections  (c^  ^^  1  barn)  are  consistent  with 
experimental  results.  For  a  Yukawa  well  the  best  range  is  ^hl^m^c  which  is 
consistent  with  the  idea  that  the  force  arises  from  the  exchange  of  two  pions. 

(B)  The  A-spin  is  opposite  to  the  deuteron  spin 

The  singlet  state  for  the  yl -nucleon  system  is  favoured  here  and  the  potential 
is  not  purely  Ug  since  the  deuteron  system  is  in  the  triplet  state.  Hence  there  is 
no  unique  solution  and  only  relative  magnitudes  can  be  found.  The  limiting 
factor  is  that  the  ^W^  is  unbound  and  hence  the  range  cannot  be  too  small. 
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Another  method  of  dealing  with  this  problem  is  to  write  down  relations  between 
the  total  volume  integrals  and  the  average  parameters  of  two  body  interactions 
for  the  hyperfragments  ^H*,  ^H**,  ^He**  and  ^He**  for  both  parallel  as  well  as 
anti-parallel  spin  orientations.  These  relations  can  be  written  as: 

^2=T^-hjUt  +  jU,  (24) 

^h  =  T^  +  3Ui  +  Ug.  (singlet) 

^3=^^  + yWf+i^*  (25) 

'*4  =  ^A  +  ^Ut  +  u,  ]  (triplet) 

By  this  analysis  it  has  been  foimd  by  Sudarshan  ^  that  the  values  of  T^  obtained 
iVom  antiparaUel  spin  orientations  are  consistent  with  those  obtained  by  other 
methods,  whereas  the  equations  form  an  inconsistent  set  for  parallel  spin 
orientations  and  hence  have  no  solution  at  all.  This  leads  to  the  conclusion  that 
antiparallel  spins  are  favoured  and  in  particular  that  the  ^He^  spin  is  zero. 
This  has  the  important  consequence  of  establishing  the  K  —  A  relative  parity 
to  be  odd  since  the  mesonic  absorption  of  K'  in  He*,  i.e.  K~  +  He*,  -> ^He*  +  tc' 
has  been  observed. 

So  far  we  have  been  considering  only  two-body  A-N  forces.  Three-body  forces 
may  also  play  an  important  role  due  to  the  longer  range.  In  the  simplest  case  the 
A  particle  in  the  hyperfragment  can  exchange  two  pions  with  two  neighbouring 
nucleons.  In  such  a  case  the  form  of  the  potential  will  be 

V{ANN) [A  di  J  -{-Bor  Oj]  (26) 

^/ll  ^A  2 

where  r^^  and  r^g  ^^^  *^^  distances  between  the  A  and  the  nucleons  1  and  2 
respectively,  and  i  and  j  the  spin  state  of  the  two  nucleons  and  hence  that  of  the 
intermediate  particle  (i.e.  i7)  at  the  two  vertices.  Since  i  and  j  can  be  interchanged 
the  spin-dependent  term  Ba^  •  Oj  in  the  potential  drops  out.  Thus  we  have 

Q-i^n^Al      e~^n^A2 

V(ANN)= a^ .  (T2  Ti  •  T2  j(27) 

^Al  ^A2 

as  the  7t-N  interaction  is  proportional  to  c;  •  t.  This  is  true  only  for  the  long  range 
part  of  the  three -body  forces.  It  has  the  simple  property  that  (i)  it  does  not  depend 
on  the  spin  of  the  A  (ii)  Cj  •  cTj  Tj  •  Tj  =  —  3  for  two  nucleons  in  a  relative  S  state. 


E.  C.  G.  SuDABSHAN  and  S.  Iwao  Phys.  Rev.  Lett.  4,  140,  (1960) 
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The  binding  energies  for  light  hypernuclei  have  been  calculated  including 
three-body  forces.  It  has  been  suggested  that  the  potential  arising  from  three- 
body  forces  is  repulsive.  This  might  possibly  explain  the  fact  that  the  potentials 
V{A  —  1)  of  hypernuclei  do  not  increase  Hnearly  with  the  mass  number  A. 

Decay  modes  of  hypernuclei 

One  of  the  methods  of  determining  the  spin  of  the  A^  is  to  observe  the  decay 
modes  of  hypernuclei.  They  are  classified  into  modes  in  which 

(i)  a  7i~  is  emitted  due  to  the  A^  decay. 

(ii)  no  7i~  is  emitted.  It  is  assumed  that  the  71"  emitted  in  the  A^  decay  is 
absorbed  by  a  nucleon  by  a  process  analogous  to  the  internal  conversion  process 
in  p  decay.  The  internal  conversion  coefficient  Q  is  defined  as  the  ratio  of  the 
number  of  nonmesonic  to  mesonic  decays. 


(a) 


Fig.  45.  Decay  modes  of  yl-nuclei 
(a)  Mesonic  decay;     (b)  Non-mesonic  decay 


The  experimental  results  show  a  smooth  transition  from  the  mesonic  to  the 
non-mesonic  decays.  The  internal  conversion  coefficient  Q  depends  strongly  on 
the  spin  of  theyl^.  Lee  and  Yang^  have  given  arguments  from  angular  momentum 
considerations  that  the  spin  of  the  A^  should  be  ^/g.  An  approximate  method 
of  determining  the  spin  from  a  rough  calculation  based  on  the  analysis  of  the 
decay  of  hyper  fragments  is  as  follows : 

In  an  electromagnetic  field,  at  large  distances  (i.e.  in  the  wave  zone),  the  poten- 

tial  has  the  form  V^^^^  ^^  - 


/  1   \    1 
while  in  the  near-zone  Fnear  ^  I  ~j7~ )  —  •  ^^^  higher 

multipole  moments  Fnear  increases  very  rapidly  near  the  origin.  Conversion  takes 
1  T.  D.  Lee  and  C.  N.  Yang,  Phys.  Rev.  109,  1755  (1958). 
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place  by  interaction  through  the  potential  Fnoar  whereas  emission  takes  place 
only  by  radiating  a  wave  asymptotically  to  great  distances.  The  internal  conversion 
coefficient  Q  is  equal  to  the  ratio  of  the  two  amplitudes.  Carrying  this  analogy 
to  the  pion  field  we  find  that  the  energy  required  for  n  ~  emission  is  less  than  that 
for  internal  conversion.  The  momentum  kf,  for  internal  conversion  is  of  the  order 
of  450  MeV/c  and  the  momentum  h^  for  emission  is  of  the  order  of  95  MeV/c.  For 

a  given  I  therefore  Q^  \~r']    x  /(/)  ^  (4  •  5)^^  where  /(Z)  is  the  normalization 

factor.  Q  also  depends  on  the  hypernuclear  structure  and  in  the  simplest  approxi- 
mation is  proportional  to  the  probability  of  finding  A  near  the  nucleon,  i.e. 
\%p^  (0)  1^.  Because  of  the  small  binding  we  can  take  the  wavelength  of  the  hyperon 
to  be  large  and  the  wave  function  for  simplicity  may  be  taken  as 

1     e-*"/^ 

(where  A  is  the  Compton  wavelength  of  A^) . 

R 
We  find  that  0  =  ~y  where  R  is  the  nuclear  radius,  and  since  we  are  integrating 

I  Wa  (0)  p  over  the  nuclear  sphere  we  obtain  -y  =  — = — rn-  with  E^a  "=  binding 

energy  of  yl.  Hence  we  have  ^oc  [EsJ'k   ^       {^'^aJ^ba)  ' 

For  ^He*,  the  comparison  of  calculated  and  experimental  values  of  Q~  (for 
which  A^  ->  p  +  n~  alone  is  considered)  shows  that  1=0  or  1,  i.e.  the  tt"  is 
emitted  as  an  aS  or  a  P  wave,  higher  moments  being  ruled  out. 

There  are  corrections  to  be  made  in  the  above  calculations  due  to  (1)  all  the 
real  mesons  produced  not  escaping  because  of  the  probability  of  internal  self- 
absorption,  and  (2)  the  A^  not  being  "free"  as  it  is  assumed  to  be.  Corrections 
due  to  the  first  eflFect  have  been  shown  to  be  unimportant.  Calculations 
of  these  corrections  have  been  done  only  in  a  qualitative  fashion  and  there  is 
no  quantitative  agreement  with  experimental  results. 

From  these  results,  Dalitz^  has  shown  that  the  following  possibility  exists. 
Since  S^  =  ^/g  and  S„=  0  the  final  state  should  correspond  to  /=  0  or  1.  The 
distinction  between  these  gives  the  relative  parities  of  the  A^  and  p.  A  mixture 
of  the  two  states  Z  =  0  and  1  is  also  allowed  because  of  the  possibility  of  the  non- 
conservation  of  parity.  Calculations  of  Karplus  and  Ruderman^  yield 

Q-_Qo+^Qi_     ^th     £  =  0.1.  (29) 

1  +  e 

^0  corresponds  to  the  case  Z  =  0  and  Qiio  1=  1 ,  the  negative  sign  referring  to  jr" 
emission .  Thus  the  S  wave  predominates  though  the  P  wave  is  not  excluded  since 

1  R.  H.  Dalitz,  Phys.  Rev.  112,  605  (1958).  See  also  the  report  by  R.  H.  Dalitz  on 
hyperon-nucleon  interactions,  International  Conference  on  High  Energy  Physics,  Kiev  (1959). 

2  R.  Karplus  and  M.  Rtjderman,  Phys.  Rev.  102,  247  (1956). 
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parity  is  not  conserved.  This  idea  can  be  checked  by  considering  specific  decay 
modes  such  as 

^H*->He4  + JT-.  (30) 

If  the  spin  of  the  hypernucleus  is  J,  then  n  has  ?  =  J  (since  the  spin  of  He*  is 
zero).  If  J  =  1  is  a  weak  channel  for  the  yl-decay  this  wiU  rarely  occur.  For  the 
other  decay  mode  ^II*->H^  +  p  +  n"  other  possibilities  exist.  Experimentally 
ten  of  the  former  have  been  observed  as  against  seven  of  the  latter  decay  modes. 
So  TF"  has  a  high  probability  of  being  emitted  in  the  dominant  channel  J  =  0 
for  ^H*.  Hence  the  singlet  state  is  a  bound  state  for  ^H*.  We  can  also  show 
that  71  is  emitted  in  the  S  state  in  the  decay  ^H^  ->  He^  -V  n~ .  Similar  reasonings 
favour  the  following  conclusions : 

(1)  The  spin  of  the  A"",  S^  =  1/2. 

(2)  A^  decays  with  mixed  parities,  the  8  state  being  predominant, 

(3)  A^  is  bound  to  the  nuclei  with  a  favoured  singlet  orientation. 
Lifetimes  of  hjrpernuclei  have  been  measured  and  these  can  be  used  to  verify 

our  assumptions  since  the  calculation  that  gives  the  conversion  coefficient  should 
also  give  the  total  (conversion  +  radiation)  lifetime. 

So  far  we  have  not  considered  hypernuclei  having  bound  hyperons  other  than  a 
A^  such  as  U^'^  or  S~.  The  27's  can  decay  rapidly  by  exchanging  pions  with 
nucleons  without  changing  their  strangeness.  Therefore  stable  nuclei  can  only  be 
formed  by  i7+  particles  with  protons  and  Z~  with  neutrons.  But  the  formation 
of  the  (i7^,  p)  system  is  difficult  because  of  Coulomb  repulsion.  The  system  (Z\  n) 
making  a  collision  in  flight  with  a  nucleus  is  believed  to  have  been  observed 
although  the  interpretation  of  the  event  is  quite  complicated.^ 

Two  events  have  been  identified  where  the  disintegration  energy  is  considerably 
greater  than  that  of  the  A^  decay.  The  energy  shows  that  the  unstable  particle 
can  be  bound  to  the  nucleus  for  times  greater  than  10 "^^  sees.  This  leads  to  two 
possibilities,  (i)  bound  charged  hyperon,  (ii)  bound  iT-particle.  In  the  first  case 
the  decay  scheme  S^  ->  A^  +  tz"  leads  to  a  disintegration  energy  of  about  380 Me V 
which  is  considerably  greater  than  that  due  to  the  decay  of  a  A^  particle.  In  the 
second  case  the  energy  available  from  the  absorption  of  a  bound  K  particle  is 
496  MeV. 

3.  RELATIVE  PARITY  BETWEEN  STRANGE  PARTICLES^ 

In  an  earlier  chapter  we  merely  summarized  the  experimental  data  available 
on  the  strong  interactions  of  strange  particles,  particularly  those  of  ^l"*",  K~  andyl^. 
We  will  now  describe  the  attempts  to  understand  these  data  from  a  theroretical 
study  of  strong  interactions.  Since  there  is  no  single  theory  of  strong  interactions 
which  adequately  explains  the  strange  particle  phenomena,  we  shall  briefly  refer 


1  R.  H.  Dalitz,  loc.  cit.,  International  Conference  on  High  Energy  Physics,  Kiev  (1959). 

2  This  section  was  written  in  a  rather  detailed  manner  at  the  suggestion  of  Dr.  T.  Kotani 
while  he  was  on  a  visit  to  Madras. 
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to  the  salient  features  of  the  various  attempts  to  understand  the  strong  inter- 
actions. 

The  first  step  in  the  study  of  any  interaction  is  to  postulate  or  derive  the 
Lagrangian  or  Hamiltonian  for  the  interaction  between  the  particles.  In  order 
to  be  able  to  write  down  the  interaction  Lagrangian  for  :;r-baryon  and  K-hskryon 
interactions  we  should  know  the  relative  parities  of  the  strange  particles  with 
respect  to  the  nucleons.  We  have  already  seen  how  through  71-d  reactions  n  is 
shown  to  be  a  pseudoscalar  with  respect  to  the  nucleon.  Since  parity  is  not  a 
conserved  quantity  in  weak  interactions  the  parity  of  the  final  system  after  a 
decay  reaction  cannot  be  equated  to  that  of  the  initial  system.  Thus  we  have  to 
deduce  the  parity  of  the  strange  particles  only  from  the  strong  and  electromagnetic 
interactions.  The  parity  of  a  strange  particle  relative  to  the  nucleon  can  be  deter- 
mined if  we  can  have  a  reaction  in  which  one  strange  particle  takes  part  in  the 
reaction  forming  part  of  the  initial  or  final  system.  However  conservation  of 
the  strangeness  quantum  number  in  the  strong  and  electromagnetic  interactions 
demands  that  more  than  one  strange  particle  be  involved.  We  recall  that  the 
associated  production  of  strange  particles  is  a  typical  example.  For  these  inter- 
actions only  the  relative  parity  of  one  strange  particle  with  respect  to  another 
has  a  meaning.  It  is  easy  to  see  that  we  need  only  the  information  on  the  rela- 
tive parities  Pxa^  Vk2:^  Pa£>  Pkk  ^^^  p^^s-  We  will  discuss  now  the  various  pro- 
posals made  to  determine  these  parities.^ 

Before  doing  so  we  shall  refer  to  a  difficulty  in  defining  uniquely  the  relative 
parities^  of  the  particles,  attention  to  which  was  originally  drawn  by  Wick, 
Wightman  and  Wigner.^  Since  the  parity  of  a  field  is  essentially  a  phase  factor 
under  P,  we  can  define  the  relative  parity  between  two  states  only  when  we  can 
measure  the  relative  phase  of  the  two  states,  i.e.  the  two  states  form  a  coherent 
system  to  which  the  quantum  mechanical  principle  of  superposition  of  states 
applies.  This  in  turn  implies  the  existence  of  an  observable  which  has  a  non- 
vanishing  matrix  element  between  them.  If  all  the  states  of  a  system  can  be  grouped 
into  different  sets  such  that  we  can  find  no  observable  having  a  non-vanishing 
matrix  element  between  any  state  drawn  from  one  set  and  any  state  drawn  from 
another  set,  a  transition  between  them  is  forbidden  completely,  and  this  is  known 
as  a  super-selection  rule.  A  typical  example  of  a  super-selection  rule  is  provided 
by  the  total  charge  operator,  for  we  cannot  conceive  of  any  observable  (e.g.  total 
energy  and  angular  momentum,  operators)  that  has  a  non-zero  matrix  element 
between  states  of  different  total  charge.  This  feature  is  a  consequence  of  charge 
conservation  in  all  the  known  interactions. 


^  See  the  review  article  "Some  Topics  Regarding  the  Strong  Interactions  of  Strange 
Particles"  by  D.  Amati  and  B.  Vitale,  Fortschr.  der  Physik  7,  375  (1959).  This  paper  also 
contains  an  extensive  bibliography  on  this  subject. 

2  The  parity  referred  to  here  is  achieved  by  means  of  a  suitable  choice  of  phase  to  provide 
for  the  invariance  of  the  strong  and  electromagnetic  interactions. 

3  G.  C.  Wick,  A.  S.  Wightman  and  E.  P.  Wigner,  Phys.  Rev.  88,  101  (1952). 
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Whenever  a  super-selection  rule  exists  it  is  impossible  to  define  the  relative  parity 
uniquely  and  the  definition  of  it  becomes  a  matter  of  convention.  To  see  this, 
we  recall  that  charge  conservation  implies  that  the  interactions  are  invariant 
under  a  phase  transformation  (i.e.  gauge  transformations  of  the  first  kind). 
Invariance  under  such  a  phase  transformation  necessarily  means  that  the  phase 
of  the  field  operators  and  consequently  the  parities  are  unobservable.  We  wish 
to  remind  ourselves  that  for  invariance  under  phase  transformations  the  inter- 
action Lagrangian  should  contain  only  terms  like  99"'' 99,  y)'^  vk  Thus  the  above 
discussion  shows  that  for  a  group  of  particles  obeying  a  gauge  transformation, 
the  relative  parity  can  be  defined  only  when  we  fix  arbitrarily  the  parity  of 
one  particle  from  the  group.  However  for  a  particle  which  does  not  belong  to 
this  group  we  can  define  a  parity  which  is  independent  of  the  convention  used 
for  the  particles  in  the  group. 

Besides  charge  conservation,  we  have  also  baryon,  lepton  and  hyper  charge 
(or  strangeness)  conservations  which  also  necessarily  imply  the  existence  of 
super-selection  rules  for  the  observables  we  normally  use.  These  conservation 
laws  can  lead  to  another  possibility  that  several  gauge  groups  may  exist  with 
common  members.  In  this  circumstance  we  can  adopt  the  following  procedure. 
If  ^  and  B  are  two  gauge  groups  of  particles  having  common  members,  we  can 
for  the  purpose  of  parity  consider  the  two  gauge  groups  A  and  B'  where  B^ 
contains  all  the  particles  in  the  original  A  and  B  after  removing  the  common 
members,  for,  as  a  convention,  we  can  adopt  an  identical  phase  for  the  common, 
members  of  both  the  groups.  Once  we  arrive  at  A  and  B'  we  can  adopt  the  usual 
procedure.  This  rearrangement  idea  can  be  easily  extended  to  any  number  of 
overlapping  gauge  groups.  The  particles  belonging  to  an  isomultiplet  have  the 
same  intrinsic  parity  for  they  are  treated  as  different  components  of  the  same  field 
and  thus  the  relative  parities  within  an  isotopic  multiplet  should  be  even. 

The  above  discussion  implies  the  following  conclusions  regarding  the  intrinsic 
parities  of  the  elementary  particles.  The  vacuum  is  assumed  to  have  even  parity. 

(1)  The  electromagnetic  field  can  be  realised  classically  and  thus  the  correspond- 
ence principle  requires  that  the  phase  of  the  electromagnetic  field  should  be  so 
chosen  that  the  classical  theory  should  be  left  invariant  under  P.  Since  ^  is  a 
polar  vector,  the  photon  has  odd  parity. 

(2)  The  intrinsic  parity  of  jt^  can  in  principle  be  determined  if  we  can  measure 
the  polarizations  of  the  two  gamma  rays  emitted  when  it  decays. ^ 

(3)  The  relative  parity  between  U^  and  A^  can  be  determined  by  means  of 
the  reaction  Z^  ->A^  +  y  (see  below). 

(4)  All  the  other  relative  parities  can  be  determined  only  by  making  further 
assumptions.  The  usual  assumption  is  that  p,  n  and  A^  have  even  relative  parity. 
As  noted  before  charge  independence  helps  us  to  fix  the  parities  of  particles 
within  an  isomultiplet.  Thus  the  77;+  and  7i~  parity  can  be  chosen  through  that 
of  the  TT^. 


1  L.  WoLFENSTEiN  and  D.  G.  Ravenhall,  Phys.  Rev.  88,  229  (1952). 
20* 
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(5)  The  electrons  and  //-mesons  have  only  electromagnetic  interactions.  Due 
to  lepton  conservation,  the  total  number  of  electrons  and  //-mesons  must  be  con- 
served and  they  must  also  be  separately  conserved  to  prevent  strong  //  ->  e 
transformations.  Thus  even  this  interaction  is  subject  to  a  superselection 
rule  which  allows  us  to  conclude  that  the  fj.-  e  relative  parity  and  the  relative 
parities  of  //  and  e  with  respect  to  other  particles  have  no  physical  significance. 
In  all  these  discussions  we  have  neglected  the  weak  interactions  the  inclusion 
of  which  will  render  the  definition  of  parity  meaningless. 

Relative  parity  p^^i  and  p^j: 

There  exist  **  in  the  air  "many  suggestions  to  determine  ^7f^  and  Pkj:  (which  will 
hereafter  be  denoted  as  p^^  y  where  Y  can  refer  to  Aot  E).  In  what  follows  the  even 
and  odd^jj^  will  correspond  to  ''scalar"  and  "pseudoscalar"  iC-mesons  respectively. 
Let  us  now  consider  these  suggestions  based  upon  (a)  production,  (b)  scattering 
and  (c)  absorption  processes. 

(a)  Production  processes.  Associated  production  by  pions  in  He*  can  be  studied 
by  the  passage  of  a  pion  beam  through  a  bubble  chamber.  We  have 

jr+  +  He*->^He4  +  Z+.  (1) 

As  the  reaction  indicates  the  produced  A^  is  absorbed  by  He^  nucleus  to  form 
the  hyperfragment  ^He*.  This  process  offers  a  very  good  indication  regarding  ^p^^i 
as  was  first  observed  by  Sakurai.  ^  Let  us  assume  that  ^He*  has  spin  0  and  even 
parity  since  the  A — N  relative  parity  is  even  by  assumption .  Since  He*  has  spin  0  and 
even  parity,  the  parity  of  the  initial  system,  i.e.  n  and  He*  is  given  by  ^  =  —  (—1)' 
where  I  represents  the  orbital  angular  momentum  and  the  total  angular  momen- 
tum equals  the  orbital  angular  momentum.  From  conservation  of  total  angular 
momentum,  it  follows  that  the  orbital  angular  momentum  of  the  final  state  is 
also  /  remembering  that  ^He*  and  K^  are  spinless.  If  pj^^  is  even,  the  parity  of 
the  final  state  will  be  given  by  ( — 1)^  in  which  case  the  reaction  cannot  take  place. 
Thus  the  observation  of  h5rperfragments  in  7r+  +  He*  reaction  clearly  indicates 
that  Pka  is  odd,  i.e.  iT-meson  is  pseudoscalar  like  the  yr-meson. 
Photoproduction  of  A'-mesons  is  represented  by  the  reactions 

'  y  +  N->K+  Y.  (2) 

The  experiments  indicate  that  the  angular  distribution  of  K^  from 
y  +  p-^A^  +  K^  does  not  deviate  significantly  from  isotropy  though  there  is 
a  rough  indication  of  a  small  peak  in  the  region  around  90°.  The  cross-sections 
for  y  +  iV  ->  2"  +  ^  and  y  +  N-^A  +  Kdo  not  differ  very  much  at  comparable 
energies.  We  know  very  little  about  the  photoproduction  of  K^.  Except  for  the 
lowest  order  perturbation  calculations^  no  theoretical  study  of  the  problem  has 


1  J.  Sakurai,  Phys.  Rev.  107,  1119  (1957). 

2  A.  Fujn  and  R.  E.  Marshak,  Phys.  Rev.  107,  570  (1957). 
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been  made  so  far.  However  if  we  assume  only  8  and  P  waves  exist  in  the  final 
state,  we  can  reduce  the  number  of  multipoles  that  contribute  to  the  reaction.^ 
We  give  below  in  a  table  a  list  of  all  the  allowed  initial  and  final  states. 


Multipole 

' 

i'irreven 

P£r  odd 

E, 

1/2 

Pll2 

Sii2 

E, 

3/2 

PSI2 

- 

M, 

1/2 

^1/2 

Pl/2 

M, 

3/2 

- 

P3/2 

Here  E^  and  M^  refer  to  the  electric  and  magnetic  dipole  transitions.  As  in  the 
case  of  the  photoproduction  of  tt's,  the  neutral  K^  can  only  be  produced  directly 
by  the  nucleons  ("shaking  off"  transitions)  and  hence  they  cannot  change  the 
orbital  angular  momentum  by  direct  interaction  with  the  electromagnetic  field. 
Thus  for  even  p^  y  ?  ^^  will  be  produced  in  an  /S  state  while  for  odd  PxY^  ^^  will 
be  in  a  P-state.  This  feature  impUes  that  near  the  threshold  the  total  cross-section 
will  vary  as  pjg  and  pi  in  the  two  cases  respectively  where  pj^  is  the  momentum 
of  the  K-meson  in  the  laboratory  system  while  the  differential  cross-section 
will  be  isotropic  for  even  pxA  and  will  have  the  form  A  +  B  cos^^  for  odd  pj^^i . 
At  present  very  little  experimental  work  has  been  done  on  photo -^^  production. 
Further  it  has  been  shown  that  if  we  take  into  account  the  anomalous  magnetic 
moments  of  the  fermions  involved  in  the  reaction,  the  distinction  between  the 
features  of  the  angular  distribution  of  scalar  and  pseudoscalar  photoproduction 
of  K  becomes  less  pronounced.^  Recently  Moravcisk^  has  adopted  the  extra- 
polation procedure  (see  below  the  discussion  on  p^^^  P^i'ity)  used  to  determine 
the  pion-nucleon  coupling  constant  from  pion  photoproduction*  to  the  photo 
production  of  K  in  order  to  determine  the  parity  of  the  K-m.eson.  This  suggestion 
depends  on  the  fact  that  the  sign  of  the  residue  is  positive  for  pseudoscalar 
iT-mesons  and  negative  for  scalar  iT-mesons.  The  analysis  of  the  K'^-A^ 
photoproduction  between  100-1000  MeV  seems  to  favour  the  pseudoscalar 
iiT"'' -meson. 


Associated   production    in   71-  N    coUisions    can   yield 
formation  on  pj^^  and  Pkz-  Consider  the  process 


us 


mdependent 


TT" 


+>^ylo+  Zo. 


m- 


(3) 


At  the  threshold  the  A^  +  K^  will  be  produced  in  the  8  state  and  the  P  state 
respectively  for  the  two  cases  of  even  and  odd  Pka-  I^  is  clear  that  only  Pi    and 


1  D.  Amati  and  B.  Vitale,  Nuovo  Cim.  6,  395  (1957). 

2  M.  Kawaguchi  and  M.  J.  Moravcisk,  Phys.  Rev.  107,  563  (1957). 
'•^  M.  J.  Moravcisk,  Phys.  Rev.  Lett.  2,  171  (1959). 

*  J.  G.  Taylor,  M.  J.  Moravcisk  and  J.  L.  Uretsky,  Phys.  Rev.  113,  689  (1959). 
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S^  waves  of  the  initial  system  will  be  affected  by  the  production  in  the  two  cases 
respectively.  Thus  we  can  expect  the  7i~-p  scattering  to  exhibit  an  anomalous 
behaviour  (i.e.  cusp  phenomena)  at  the  threshold  for  A^  production.  This  effect 
was  estimated  by  Adair  ^  to  be  10  per  cent  of  the  ji-p  differential  cross-section. 
We  will  discuss  presently  the  anomalous  behaviour  in  a  process  near  the  threshold 
of  another  process  in  detail  when  we  study  the  suggestions  made  to  determine 
iT-A"  relative  parity.  Experiments  can  also  help  in  deciding  whether  the  S^ 
or  P^    wave  is  affected.  However  we  know  that  the  7z~-p  cross-section  has  a 


K*    ,-- 


Fig.  46.  Associated  production  in  Ji  -p  collision  involving  direct  K-K-n  interaction. 


rather  broad  maximum  near  900  MeV  and  thus  there  seems  to  be  no  anomalous 
effect. 

Another  possibility  of  using  this  production  process  has  been  pointed  out  and 
is  based  on  the  fact  that  the  angular  distribution  of  the  produced  hyperon  depends 
on  the  sign  of  Pky^-  To  realise  this,  we  note  that  the  transition  matrix  in  the 
cm.  system  will  have  the  form 


T^A+Ba-ip^x  Pk)  Vky  =  odd 

=  C(Jp„  +  Da'P^  Pky  =  even 


(4) 


where  a  is  the  baryon  spin  operator  and  p„  and  p^  are  the  momenta  of  the  tz  and  K 
mesons  respectively  in  the  cm.  system.  The  expressions  are  derived  using  con- 
siderations based  on  parity  conservation.  Remembering  that  tz  is  pseudoscalar, 
T  must  be  scalar  for  pjr  y  =  odd  and  pseudoscalar  for  Pky  =  even  since  T  connects 


1  R.  Adair,  Phys.  Rev.  Ill,  632  (1958). 

2  D.  Amati  andB.  Vitale,  Nuovo  Cim.  9,  340  (1958);  R.  H.  Capps,  Phys.  Rev.  115,  736 
(1959). 
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the  TZ-p  and  K-A  states.  A,  B,C  andZ>  are  scalar  coefficients  and  are  complex 
functions  of  p„  •  pj^  and  the  energy  co.  We  can  now  adopt  Watson's  ^  theorem 
which  expresses  the  photoproduction  amplitude  in  terms  of  the  pion-nucleon 
phase  shifts  provided  we  can  treat  the  K-Y  interaction  as  weaker  than  the  jr-iV^ 
interaction,  i.e.  we  can  neglect  the  final  state  interaction.  However  in  the  energy 
region  where  A^  production  becomes  possible,  the  phase  shift  analysis  becomes 
difficult  since  the  phase  shifts  themselves  become  complex  due  to  the  opening 
of  inelastic  channels  and  also  because  of  the  increase  in  the  number  of  partial 
waves  of  angular  momentum  making  significant  contributions  to  the  reaction. 
Using  the  density  matrix  formalism  we  can  express  the  initial  polarization  in  a 
direction  n,  if  Qi  is  the  density  matrix  for  the  initial  state,  as 

Qi(n)  =  Spla-HQi],  (5) 

where  >S'^[^]  indicates  the  spur  of  the  matrix  [A].  Also  we  have  normalized  the 
initial  state  such  that  Splg^]  =  1.  The  density  matrix  for  the  final  state  Qf  is 
given  as  a  function  of  the  energy  co  and  the  production  zenith  and  azimuth  angles 
6  and  w : 

Qf{co,d,p)^T{co,d,cp)QiTH(o,6,(p).  (6) 

The  differential  cross-section  and  final  polarization  in  a  direction  tn  are  given  by 
■j^(co,  d,  (p)  =  Sp[Qf{co,  6,  (p)]  (7a) 

Using  these  formulae,  we  can  obtain  the  differential  cross-section  and  final 
polarization  of  A^  whose  measurement  can  serve  to  distinguish  between  the  two 
cases  of  scalar  and  pseudoscalar  K.  In  passing  we  note  that  the  A^  can  be  polarized 
only  in  a  direction  normal  to  the  production  plane  defined  by  p„  and  pj^  due  to 
parity  conservation.  Capps^  has  noted  an  interesting  situation  namely  that  if 
the  nucleon  polarization  direction  n  is  perpendicular  to  the  incident  tt  beam, 
then  the  transverse  spin-spin  correlation  can  serve  to  determine  Pka-  More  pre- 
cisely we  have  in  the  forward  (or  backward)  direction 

Pf{n)=      p^{n)  Pka  odd 

Pf{n)^-Pi{n)  2?^^  even. 

The  A  polarization  direction  may  be  obtained  from  the  asymmetry  in  the  A 
decay,  the  sign  of  the  asymmetry  being  known  from  the  experiment  of  Boldt 
et  a/.  ^Another  method  of  deciding  ^^^  is  to  perform  experiments  with  polarized 


1  K.  M.  Watson,  Phys.  Rev.  95,  228  (1954). 

2  R.  H.  Capps,  loc.  cit.  See  also,  A.  P.  Balachandran  and  N.  R.  Ranganathan,  Nuovo 
Cim.  16,  1142  (1960). 

3  Boldt  et  al,  Phys.  Rev.  Lett.  1,  256  (1958). 
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and  unpolarized  targets.  It  can  be  shown  that  the  polarization  of  A^  when  the 
target  is  unpolarized  is  given  by  the  left-right  asjmametry  in  the  differential 
cross-section  for  A^  production  resulting  from  a  polarized  target,  but  the  sign 
of  the  polarization  depends  on  Pka-^  The  result  is  given  by 

-^(Q.^)--^(Q.<p)V^±fdO,<P)Vo(Q^9>)\>  (9) 

d(7o 
where  -t-P\  and  'p^  denote   the  differential  cross-section  and  final  polarization 

do" 
using  an  unpolarized  target  while  -rjy  and  p^  refer  to  the  differential  cross-section 

obtained  when  the  target  has  polarization  ^iven  by  pi .  The  plus  and  minus  signs 
refer  to  odd  and  even  fj^^  cases  respectively. 

Associated  production  in  the  N-N  collisions  can  also  be  analysed  using  parity 
and  angular  momentum  conservation.  As  before  for  the  two  p^y^^  cases,  we  will 
have  different  orbital  angular  momenta  for  the  final  states.  Thus  studying  the 
momentum  dependence  of  the  total  cross-section  and  the  angular  distribution^ 
we  can  hope  to  distinguish  between  the  two  pj^^  cases.  The  present  data  for  this 
process  are  so  meagre  that  we  will  not  consider  this  any  further.^ 

(b)  Scattering  experiments.  We  shall  now  study  the  K-N  and  K-N  scatterings 
with  a  view  to  deciding  Pka-  We  shall  envisage  the  scattering  to  occur  through 
the  strangeness- conserving  Yukawa  interactions  between  iT-mesons  and  other 
baryons.  Later  when  we  discuss  the  symmetries  of  the  strong  interactions  we 
shall  write  them  down  explicitly.  We  shall  merely  indicate  what  can  happen 
at  a  vertex,  i.e. 

N^A  +  K',    N<~>Z+K;    A<->S  +  K;    E<->S+K.  (10) 

In  order  to  see  the  influence  of  Pka^  we  observe  that  for  a  scalar  K,  the 
final  A  +  K  will  be  in  an  >Sf  state  at  the  threshold.  In  a  similar  way,  for  a 
pseudoscalar  K,  it  will  be  in  a  P  state.  Thus  the  corresponding  interaction 
Lagrangian  will  be  different  in  the  two  cases.  The  first  will  not  involve  a  y^  while 
the  second  will.  A  simplified  model  treating  the  nucleon  as  a  fixed  heavy  source 
and  also  neglecting  the  contribution  from  the  virtual  K  and  tf  from  the  inter- 
mediate states  for  the  K-N  scattering  has  been  studied  using  the  Tamm-Dancoff 
approximation.^  Neglect  of  the  nucleon  recoil  is  certainly  unjustifiable  in  the 
K-N  scattering  unlike  the  jt-N  scattering.  The  study  throws  practically  no  light 

onpKY- 

The  dispersion  relations  technique  has  been  used  for  the  determination  oi  pxA 
and  Pj^^  and  the  corresponding  coupling  constants  g^^  and  gxz-  Disregarding 
for  the  moment  the  difficulty  in  proving  the  dispersion  relations  for  K^-N 
scattering  because  of  the  possible  breakdown  of  the  necessary  relation  between 

1  S.  M.  BiLENKH  and  R.  M.  Ryndin,  J.  Expt.  Theoret.  Phys.  35,  827  (1958). 

2  G.  Costa  and  B.  T.  Feld,  Phys.  Rev.  109,  606  (1958). 

3  D.  Amati  and  B.  Vitale,  NtiovoCim.  5,  1533;  6,  261  (1957).  C.  Ceolin  and  N.  Taffara, 
Nuovo  Cim.  6,  425  (1957). 
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K  and  A  masses,  the  dispersion  relations  for  the  iT-proton  forward  scattering 
can  be  written  down  as  follows.^ 


C0^  +  CO 


m^-\-  0) 


+ 


+ 


rriK 
J^     r    dco'A_{M' 

71     J 


G^{(o')    ^    a_{a)') 
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71     J 
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In  this  expression!)^  denote  the  real  parts  of  the  K^-p  forward  scattering  ampli- 
tudes respectively  and  ^_  denotes  the  imaginary  part  of  iT^-^  forward  scatter- 


ing amplitude  and  coi 


'^N 


2mN 


mi  being  the  rest  mass  of  the  system  i 


{i  =  AoxE).  The  first  two  terms  on  the  right  hand  side  are  the  poles  arising  from  the 
possible  intermediate  states,  viz.  A  and  27 particles  in  the  iT-^  scattering  alone.  The 
interesting  point  is  that  the  residues  at  these  poles  depend  on  the  parities  Pka 
and  Pj^^  and  involve  the  corresponding  coupling  constants.  The  expressions  for 
the  residues  are 


Vka=  +1; 


-1 


X{A) 


X{A)  = 


Qka 


4:71 

9ka 

471 


(m^ 


m«r  -  mi 


4m^w^ 


4:mj,m^ 


(13a) 


(13b) 


Similar  expressions  hold  for  Z  also  with  H  replacing  A.  Since  in  the  energy  region 
from  co^7r«  co27ji)  to  m^,  though  unphysical  for  K^-p  scattering  we  can  have 
A-7t  or  Z-71  states  of  positive  kinetic  energy,  we  have  to  integrate  A_  over 
this  region.  For  K^-n  scattering,  the  dispersion  relations  have  the  same  form 
except  that  Z  (yl)  =  0  since  K--n  is  in  a  pure  T  =  1  state.  Thus  only  the  E  state 
can  contribute  a  pole  and  the  K^-n  dispersion  is  related  to  Pke-  Also  X{E) 
is  twice  the  expression  given  above. 


1  D.  Amati  and  B.  Vitale,  Nuovo  Cim.  7,  190  (1958). 
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The  behaviour  of  ^_  in  the  unphysical  region  can  be  quite  complicated.  A_ 
becomes  a  finite  quantity  as  co  ->  nix  since  for  an  endothermic  reaction  the  cross 

section  behaves  like  -j-  near  the  threshold.  We  can  also  assert  that  A    has  to 
k 

vanish  for  co  <  o)^„.  Though  in  principle  an  analytic  continuation  of  ^_  from  th(^ 

physical  region  to  the  unphysical  region  should  throw  light  on  the  behaviour  of  ^_ 

the  scattering  amplitudes  can  have  cusps  in  the  imphysical  region  at  the  threshold 

for  reactions  like  Hn  and  A  -{-  n  -\-  ti'^.  It  was  also  pointed  out  by  Salam  and 

Matthews  who  have  analysed  the  behaviour  of  .4_  by  means  of  perturbation  theory 

that  A_  can  go  through  zero  and  take  negative  values  in  the  unphysical  region.^ 

The  convergence  of  the  integrals  over  the  cross-sections  in  (11)  is  not  possible 

if  both  a+  and  a_  approach  constant  values  at  infinite  energy.   GoebeP  and 

Matthews  and  Salam*  have  suggested  the  use  of  the  dispersion  relations  obtained 

by  subtracting  (12)  from  (11).  Assuming  pj^^:  to  be  even,  we  have  at  ca  =  m^ 

00 

mAD, {m^)  -  D_{mi,)-\  -  ^- j  ^  [a, (co')  -  a_ {(o')] 

rriE 

^2S?hL±^kA     for  scalar  ^-meson  (14b) 

4:71 


[2mp\   [ 


9ka  +  gKH 

4:71 


for  pseudoscalar^L -mesons.      (14c) 


It  is  probable  that  the  integral  over  the  cross-section  in  (14  a)  is  convergent. 
In  the  energy  region  where  we  have  experimental  data,  a_  is  always  larger  than  a+ . 
Since  K'  -f  p  has  so  many  channels  compared  to  K+  +  ^  we  can  expect  that 
even  at  higher  energies  a_  >  c^ .  The  integral  over  the  cross-section  is  therefore 
positive.  Further  D+{mj^)  is  known  to  be  negative  and  small  while  D_{mj^)  is 
quite  large,  li  K'-p  interaction  is  repulsive  theni)_(w^)  is  negative.  We  can 
thus  conclude,  neglecting  the  small  contribution  from  the  unphysical  region, 
that  the  iT-meson  is  a  scalar,  for  the  whole  quantity  in  the  left-hand  side  is  positive. 
However  if  K-p  interaction  is  attractive  as  it  seems  to  be,  then  D_(mg) 
is  positive  and  the  quantity  in  the  left-hand  side  depends  upon  the  relative 
magnitude  of  D_  {rrig)  and  Z)+  {m^  and  the  numerical  value  of  the  integral.  But 
it  seems  that  this  is  more  likely  to  be  negative  indicating  that  iT-meson  is  probably 

1  K.  Igi,  Progr.  Theoret.  Phys.  19,  238  (1958). 

2  P.T.Matthews  and  A.  Salam,  Phys.  Rev.  110,  565,  569  (1958);  R.  H.  Dalitz  and 
S.  F.  TuAN,  Ann.  Phys.  8,  100  (1959). 

3  C.  GoEBEL,  Phys.  Rev.  100,  572  (1958). 

*  P.  T.  Matthews  and  A.  Salam,  loc.  cit.  (1958). 
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pseudoscalar.  Igi^  has  suggested  a  different  type  of  dispersion  relation  where  the 
integrals  converge  rapidly  and  which  is  in  some  measure  weighted  against 
contributions  from  the  unphysical  region.  The  dispersion  relation  has  the  form 

CO  [i)+  (co)  —  J  (mjf  +  (o)  Z)+  (m^-)  —  J  (m^  —  a))  D_  (m^)] 


THk 


1     r  dco  \    G_^  ^-1     1    r  do 

4:7ti  J      k'    [o}'  —  (JO        ft>'  +  ft)  J       71   J      k' 

-'Pky[X{A)  +  X{E)-] 


1     r  dft)'  A_{(d') 


co'  +  ft) 


(15) 


The  contribution  from  the  unphysical  region  is  estimated  by  taking  a  smooth 
extrapolation  of  ^_  for  co  <  m^.  Assuming  a^  to  be  constant  it  is  found  that  if 

D_  is  positive,  K  is  pseudoscalar  with ~  4,  while  if  D_  is  negative, 

K  is  scalar  with  [g^A  +  9k2:)I^^  ^^  0.08.  Assuming  that  (T+  falls  off  by  a  factor 
of  two  between  50  MeV  and  zero  energy,  the  iiT -meson  seems  to  be  scalar  irrespective 

of  the  sign  of  D_  for ~  4.  Recently  the  dispersion  relations  have  been 

4  TT 

used  to  determine  the  sign  and  magnitude  of  D_  (m^)  which  is  sensitive  to  pjg^y. 
Also  the  low  energy  behaviour  o^^  has  been  used  to  decide  pj^  y  ^1  these  arguments 
favour  only  a  pseudoscalar  iT-meson  assuming  p^E  to  be  even. 

Perhaps  with  more  accurate  data  on  K^-  p  scattering  cross-teections  at  energies 
below  50  MeV  and  also  from  their  angular  distributions ,  we  may  be  able  to  decide  p^  ^  . 
Further  if  we  have  more  data  from  K^-n  scattering,  we  can  form  an  independent 
conclusion  on  pj^^:  which  can  be  also  utilized  to  discuss  K^-p  dispersion  relations. 

(c)  Absorption  processes.  Several  proposals  to  decide  pj^  y  froni  K~  absorption 
in  hydrogen,  deuterium  and  hehum  have  been  put  forward.  It  is  obviously  very 
important  in  studying  these  processes  to  have  some  idea  of  the  atomic  orbit 
from  which  iT^-mesons  are  captured.  Recently,  Day,  Snow  and  Sucher^ 
have  drawn  attention  to  the  influence  of  Stark  effect  collisions  of  a  highly  excited 
K-  —  p  atom  with  the  neighbouring  protons  in  the  liquid  hydrogen.  They  thus 
obtain  the  very  important  result  that  the  capture  at  rest  of  K~  always  takes  from 
8  orbits,  i.e.  the  relative  orbital  angular  momentum  of  the  K~-p  system  is 
always  zero.^  Similar  considerations  can  be  expected  to  hold  for  captures  at  rest 
of  K-  in  d.    , 


1  K.  Igi,  Progr.  Theoret.  Phys.  20,  403  (1958). 

2  T.  B.  Day,  G.  A.  Snow  and  J.  Sucher,  Phys.  Rev.  Lett.  3,  61  (1959). 

^  From  the  capture  at  rest  of  X^  in  protons  we  can  determine  the  spin  of  the  hyperons. 
Making  use  of  the  fact  that  capture  is  from  8  orbit,  the  arguments  of  Adair  given  for  deter- 
mination of  hyperon  spin  in  Ji'-P  collisions  can  be  repeated.  Thus  if  there  is  no  anisotropy 
in  the  decay  products  of  the  hyperon,  the  spin  of  the  hyperon  is  J  (see  S.  B.  Treiman,  Phys. 
Rev.  101,  1217  (1956)).  Recent  experiments  by  Leitner  et  at.  (see  Phys.  Rev.  Lett.  3,  238 
(1959))  indicate  that  the  decay  distribution  does  not  statistically  deviate  from  isotropy^ 
and  hence  they  conclude  that  spin  of  i7+  is  ^/g.  They  also  find  that  there  is  no  violation  of 
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Let  US  consider  the  as  yet  unobserved  reactions 

K-  f  p  ->  yl®  -f  7r+  +  71-^  (16a) 

~ylo+  71°  +  ttO.  {16  b) 

Okun  and  Pomeranchuk^  have  estimated  from  phase  space  considerations* 
that  these  should  constitute  2  per  cent  of  all  the  capture  reactions  for  scalar 
iT-mesons.  If  K  were  pseudoscalar  this  reaction  is  damped  due  to  centrifugal 
barrier  effects  for  P  wave  motions  necessary  in  the  final  state  to  conserve  parity 
and  angular  momentum.  In  this  case  A^  from  (16  b),  can  be  polarized  and  due  to 
the  symmetry  of  identical  boson  wave  functions  the  branching  ratio  will  be 
affected.  Similar  results  obtain  for  absorption  in  flights  for  kinetic  energies  up  to 
30  MeV  of  K~  in  the  labotarory.^  We  can  also  note  that  the  reaction 

Z-  +  7i^i:+  +  n-  +  71-,  (17) 

whicli  can  take  place  for  K~  with  the  kinetic  energy  '^80  MeV  in  the  laboratory 
system  will  be  permitted  only  if  f^j:  is  even.  Odd  p^^:  requires  P  wave  motions 
in  the  final  state  which  is  not  possible  for  the  271'.^ 

If  (Z~n)  can  exist  as  a  bound  system,  Pais  and  Treiman*  have  shown  that  for 

{E-n)7t^ 
the  capture  oi  K~  from  an  S  orbit  in  deuterium  the  branching  ratio -jc^;^ + 

for  a  binding  energy  of  1  MeV  or  more  is  possible.  Since  we  know  from  argu- 
ments of  Day  etal.^  that  the  capture  at  rest  oiK'  takes  place  only  from  >S  orbits,. 
the  absence  of  the  reaction 

K-  +  d->(E-n)^-  7t^  (18) 

can  be  only  due  to  the  {E~n)  bound  state  having  zero  spin  and  f^j:  being  odd. 
Consider  the  reaction® 

K+d^Y+N.  (19) 

The  fact  that  iC"  is  captured  at  rest  only  from  8  orbits  helps  us  to  write  down 
the  transition  matrix  for  both  scalar  and  pseudoscalar  K  cases.  Using  arguments 
given  earlier  in  Ti'-p  reactions,  it  has  been  noticed  that  the  sign  of  the  cor- 
relation between  the  polarization  of  Y  and  N  depends  on  pj^y  parity  as  we  noted 
before.  However  this  process  occurs  in  only  about  1  per  cent  of  all  the  Jl~  capture 


parity  in  strong  interactions.  It  is  to  be  noted  that  any  component  of  the  polarization  P^:  in. 
the  production  plane  is  an  evidence  for  parity  non-conservation  in  the  strong  interaction. 
Crawford  et  al.  (see  Phys.  Rev.  Lett.  1,  418  (1958))  have  arrived  at  a  similar  conclusion,  i.e. 
parity  is  not  violated  in  strong  interactions,  by  an  analysis  of  the  decay  products  of  yi**  produced 
'uv7i~  +p^A^  +  K^  by  making  various  choices  of  coordinate  systems. 

1  L.  B.  Okun  and  I.  Ya.  Pomeranchuk,  J.  Expt.  Theoret.  Phys.,  34,  997  (1958). 

2  L.  Fonda  and  J.  E.  Russel,  Phys.  Rev.  Lett.  2,  57  (1959). 
^  D.  Amati  and  B.  Vitale.  Fortsch.  Phys.,  loc.  cit. 

*  A.  Pais  and  S.  B.  Treiman,  Phys.  Rev.  106,  1106  (1957). 

*  T.  B.  Day  et  al.,  loc.  cit.  (1959). 

«  A.  SiRLiN  and  R.  Spitzer,  Phys.  Rev.  Lett.  3,  110  (1959). 
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reactions  and  the  determination  of  polarization  is  not  easy  for  n  andi7  ,  though 
for  A^  its  polarization  can  be  estimated  from  the  decay  asymmetry. 

Finally  we  refer  to  an  interesting  proposal  due  to  Dalitz  and  Downs  ^  who 
suggested  the  possibility  of  determining  j)^^  from  K-  absorption  in  He*.  We  have 

K-  +  He*  ->  ^He*  +  n.  (20) 

Assuming  that  ^iHe*  has  zero  spin  and  even  parity,  the  reaction  is  forbidden 
for  even  ^^^  since  tt"  is  pseudoscalar.  For  odd  p^^,  the  reaction  can  take  place. 
Dalitz  2  has  also  given  arguments  against  the  existence  of  an  excited  state  of  ^He* 
with  /  =  1.  There  is  also  a  high  yield  of  hyperfragments  in  the  K~  capture  by 
He*.  Thus  we  can  conclude  that  pj^^  is  odd.  Recently  Pevsner  et  al.^  have  observed 
^He*. 

Summing  up,  we  find  that  the  dispersion  relation  approach  seems  to  indicate 
that  pseudoscalar  iT-meson  with  respect  to  both  the  hyperons  is  more  likely. 
However  we  need  to  study  the  dispersion  relations  for  K-N  scattering  when  the 
I^-A  system  has  odd  relative  parity.  If  the  various  proposals  also  confirm  that 
^L-meson  is  pseudoscalar,  we  have  the  interesting  fact  that  both  the  bosons  n 
and  K  are  pseudoscalar. 

Reldtive  parity  p^x-  -^^  pointed  out  before,  the  relative  parity  pxK  of  K^  —  K^ 
has  to  be  even  as  long  as  K^-K^  forms  an  isodoublet.  However,  as  we  sliall 
see  later,  it  is  possible  to  describe  the  K^  and  K^  as  two  different  isosinglets 
(see  Pais*  "Model  of  Strong  Interactions")  in  which  case  pj^^  can  be  defined 
and  tests  for  measuring  Pkk  ^^^  necessary.  Pais  has  studied  the  effect  of  odd  Pkk 
on  charge  independence  in  strong  interactions  and  the  production  and  scattering 
of  baryons  and  iT-mesons.  In  the  nucleon-anti-nucleon  annihilation  into  the  K 
pair,  i.e. 

n  +  p->K^  -\-  K''  (21) 

i}he  sign  of  pj^j^  will  determine  the  initial  angular  momentum  states. 

Recently  a  new  method  has  become  possible  to  obtain  the  relative  parities 
and  coupling  constants  for  the  strange  particles  using  general  field  theoretic 
concepts.  These  methods  are  essentially  a  consequence  of  the  Mandelstam  re- 
presentation^ of  dispersion  relations  from  which  it  is  possible  to  show  that  the 
existence  of  dispersion  relations  in  the  momentum  transfer  is  just  like  that  in 
the  total  energy.  Chew®  first  made  use  of  the  analyticity  properties  of  the 
scattering  and  production  amplitudes  in  the  momentum  transfer  variable  to 
determine  the  pion-nucleon  coupling  constant  from  nucleon-nucleon  scattering 


1  R.  Dalitz  and  B.  Downs,  Phys.  Rev.  Ill,  967  (1958). 

2  Hyperon-Nucleon  Interactions,  Dalitz,  Kiev  (1959). 

3  Pevsner  et  al.,  Phys.  Rev.  Lett.  3,  291  (1959).  See  also,  Block  et  al.  Proc.  of  the  Ann. 
Int.  Conf.  on  High  energy  Physics  at  Rochester  {I960).,  p.  419. 

4  A.  Pais,  Phys.  Rev.  110,  574  (1958);  112,  624  (1958);  Phys.  Rev.  Lett.  1,  418  (1958). 

5  S.  Mandlestam,  Phys.  Rev.  112,  1344  (1958). 

6  G.  F.  Chew,  Phys.  Rev.  112,  1380  (1958). 
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(see  chapt<^r  V).  We  shall  discuss  here  how  the  method  can  be  applied  to  deter- 
mine the  sign  of  Pkk-^ 

Let  us  consider  the  matrix  element  M  for  the  associated  production  process 
7i~  -\-  p-^A^  +  K^.  M  will  be  a  function  of  the  two  invariants,  the  total  energy 
10  =  (pp  +  p„Y  ^^^  t^^  momentum  transfer  Zl*  =  {pp  —  p^y^.  If  we  study  the 
analytic  behaviour  of  3/(w,  J 2)  as  a  function  of  zl^,  we  should  expect  a  pole  arising 
from  an  intermediate  stat-e  composed  of  a  single  JiT-meson  as  shown  in  Fig.  46. 


Fig.  47.  A^  and  27"  production  cross-section. 


This  figure  essentially  involves  a  iC"  —  K-ji'  interaction  which  is  possible  as  a 
strong  interaction  only  ii  Pkk  were  odd.  Thus  the  residue  at  this  pole  should  be 
zero  ii Pkk  were  even.  With  this  remark,  we  just  state  that  we  can  write  M(a),  A^) 


oc  —  cos  u 


(22) 


The  first  term  on  the  right-hand  side  is  only  the  renormalized  Born  term  corre- 
sponding to  the  figure  so  that  the  function  is  known.  /  is  a  regular  function  and 
has  no  singularity  at  cos  Q  ^  oc  .ck'i^  given  by 


2  y;t«  Pro  -  ^j 


(23) 


and  6  is  the  angle  between  p„  and  p^  in  the  centre  of  mass  system.  Since  ^  is  >  1, 
the  pole  at  cos0  =  oc  will  lie  outside  the  physical  range  of  cos 6,  viz.  — 1  ^  cos^ 


1  J.  G.  Taylor,    Nitcl.    Phys.    9,    357    (1959);  M.  J.  Moravicisk,    J.G.Taylor   and 
J.  L.  Uretsky,  Phys.  Rev.  113,  689  (1959). 
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^  1  .^  For  high  n  energies,  (x  is  close  to  unity.  For  pion  laboratory  momenta  from 
1.3  to  1.4  BeV/c,  the  range  of  the  values  of  <%  is  between  1.6  and  1.4.  The  differential 
cross- section  is  proportional  to  |M  p  and  so  we  can  write 

dG{(jo,  d)  \g{oy,  co^d\^       g{co,  cos 6)  f*{co,  cos 6) 

dQ  {(x  —  cos6)^  oc  —  cosd  /24\ 

+  \f{co,  cos6))|2 

where  K{cjo)  is  a  known  function  of  the  energy.  It  is  convenient  to  study  the 
singularity  of  the  function 

F  {w ,  cos  6>)  =  (^  -  cos  Of  ^^^^^  ■  (25) 

We  see  that 

F{co,oc)  =  K(co)  \g{co,oc)\^  (26) 

Thus  we  have  been  able  to  separate  the  term  which  can  be  calculated.  In  order 
to  decide  Pkk^  we  extrapolate  F{w,  cosO)  from  the  physical  region  to  the  un- 
physical  region  up  to  cos  6  =  oc  and  see  if  a  non-zero  residue  exists  which  will 
definitely  show  that  ^j^^  is  odd.  Further  if  F{co,oc)  is  >0,  then  ^^^  is  even  while 
if  F{(o,oc)  <  0,  Pka  is  odd  (see  the  previous  discussion  on  K'^  —  p  scattering  using 
dispersion  relations).  It  is  also  possible  to  estimate  the  coupling  constants  from 
the  magnitude  of  the  residue.  The  main  difficulty  with  this  method  lies  in  the 
extrapolation  procedure.  Recently  it  has  been  shown  that  this  method  of  extra- 
polation may  lead  to  fallacious  results  like  ji^  being  a  scalar  or  the  possibility  of 
non- conservation  of  parity  in  strong  interactions.^  If  F{a),  cos  6)  is  linear  in  cos0, 
we  should  not  conclude  that  this  linearity  is  caused  mainly  by  the  pole  term. 
We  should  be  very  cautious  in  assuming  shapes  for  F{oj,  cos0)  for  the  extra- 
polation procedure.  Possible  applications  of  this  procedure  to  hyperon-nucleon 
scattering  in  order  to  determine  hyperon  parities  and  coupling  strengths  have 
been  discussed  to  Bar  shay  and  Glashow.^  Recently  Taylor*  has  discussed  in  detail 
what  can  be  gleaned  by  this  method  from  the  processes  of  associated  pro- 
duction by  photons  on  nucleons,  by  pions  on  protons  and  from  the  absorption 
and  charge -exchange  scattering  of  K^  on  nucleons.  He  finds  that  if  p^K  is  odd, 
then  strange  particle  parities  may  be  obtained  from  all  these  processes  while 
if  p^j^  is  even,  only  the  photoproduction  process  may  be  used  to  determine 
the  parities. 

Relative  parity  Pas-  The  role  of  ^^^  in  K^-N  scattering  has  already  been 
discussed  where  A  ot  U  are  present  only  in  the  intermediate  states.  We  shall 
describe  a  few  methods  for  determining  directly  /)^^ .  A  very  interesting  suggestion 


^  There  is  another  pole  arising  from  an  intermediate  hyperon  state  which  can  be  made 
use  of  only  at  higher  ti"  energies. 

2  G.  Feldman  and  T.  Fulton,  Phys.  Rev.  Lett.  3,  64  (1959). 

3  S.  Barshay  and  S.  L.  Glashow,  Phys.  Rev.  Lett.  2,  371  (1959). 

4  J.  G.  Taylor,  PMjs.  Rev.  116,  768  (1959). 
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is  due  to  Baz  and  Okun^  who  made  use  of  a  theorem  in  nuclear  physics  due 
to  Breit  and  Wigner  regarding  the  behaviour  of  cross-sections  for  a  reaction 
near  the  threshold  of  another  reaction.  As  the  method  is  quite  generally  applicable 
we  shall  discuss  it  in  some  detail.  As  an  example,  let  us  consider  the  reaction 

71-  -{-  p-^A^+  K^  (27) 

near  the  threshold  ^  890  MeV  in  the  laboratory  system  for  the  reaction 

n-  +  p^2y-{-  K^.  (28) 

At  this  energy  range  the  total  cross-section  and  angular  distribution  for  (27) 
will  exhibit  a  cusp-like  behaviour,  the  study  of  which  can  reveal  Paz-  Near  the 
threshold  it  is  quite  justifiable  to  assume  that  {I^^K^)  is  produced  in  an  /S  state. 
Due  to  angular  momentum  conservation  it  is  clear  that  only  the  8  and  P  states 
of  A^K^  will  be  affected  by  the  Z  production  for  even  or  odd  ^^2:  respectively. 
Also  we  need  only  consider  the  /  =  ^/g  production  amplitude  since  A^K^  system  can 
have  only  /  =  ^/2.  It  is  important  to  observe  that  since  the  total  cross-section  for 
the  production  of  two  neutral  particles  is  proportional  to  =  |  fe|  where  k  is  the  wave 
vector  in  the  ly  K^  centre  of  mass  system ,  the  creation  amplitude  will  be  proportional 
to  k^ .  We  can  thus  write  the  creation  amplitude  for  E^K^  as  m^A;^  where  m^  is 
an  energy  independent  term  (depending  only  on  the  initial  states).  Considering 
these  two  channels  only,  the  S  matrix  can  be  written  as  follows  i^ 

'^ll  ^^12  ^1  ^^ 


S 


S21  S22  m^  Jci 


(29) 


where  i  ^  1,  2  refer  to  tt"  +  ;p  and  A^  +  K^  systems  respectively  and  Sij  refer 

to  the  transition  amplitudes  from  the  initial  state  i  to  the  final  state  j.  X  describes 

the  elastic  scattering  ofK^  by  Z^  and  it  is  assumed  that  JC  ->-  1  as  A;  ->  0.  We  can 

now  write  Si/s  as 

Sij{k)  =  s,^  +  ai^k+0{k^),  (30) 

where  s^^  is  the  value  of  S^j  at  the  threshold  and  a^j  are  k  independent  coefficients, 
at  least  near  the  threshold.  We  can  now  make  use  of  the  condition  that  /9  matrix 
is  unitary  above  the  threshold,  i.e.  SS^  =  1.  This  leads  to 
2  2 

UiSiiafi  +  Sjittii)  =  -niimf;  ZSii'^i=  -m^X.  (31) 

z  =  i  1=1 

Below  the  threshold  E^K^  channel  is  closed  and  k  becomes  imaginary,  i.e. 
k=  i  \k\.  Remembering  that  the  asymptotic  wave  functions  for  E^K^  should 
vanish  we  choose  -\-i  \k\.  Unitarity  of  8  below  the  threshold  leads  to 

i:{-Siiafi  +  sfj,aii)  =  0.  (32) 
1  =  1 

1  A.  I.  Baz  and  L.  B.  Okitn,  J.  Expt.  Theord.  Phys.  8,  526  (1959);  R.  Adair,  Phya.  Rev. 
Ill,  632  (1958). 

-  We  are  omitting  a  number  of  possible  channels  describing  processes  like  single  and 
double  pion  production  in  n~-p  coUisions. 
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Also  the  unitarity  at  the  threshold  gives  us  the  condition 

2 

Ususf'i  =  dij,  (33) 

z  =  i 

Solving  these  three  equations  we  obtain 

aij  =  imimj.  (34) 

In  terms  of  partial  waves,  this  result  can  be  expressed  as 

M-f  (E)  =  m\  (Eo)  +  \m^m^k,  (35) 

where  M\{E)  is  the  matrix  element  that  describes  the  creation  of  A^  and  K^ 
with  an  orbital  angular  momentum  I  at  an  energy  E.  EqIS  the  threshold  energy 
and  1/2  denotes  the  total  angular  momentum  available.  It  is  to  be  noted  that 
only  those  M/s  having  the  same  quantum  numbers  as  the  state  of  Z^K^  will  be 

affected.  The  differential  cross-section  — ^  {6,  E)  for  A^-K^  production  can  be 
expressed  as 

'^^'^'^^  =  \g(d,E)?  +  \H6,m\\  (36) 

where  g{6,  E)  is  the  non-spin  flip  amplitude  given  by 

g{e,E)  =  -l-2][{l-i-l)M\'-i  +  lMl-^Pi{G0se)  (37  a) 

and  h(d,  E)  is  the  spin-flip  amplitude  given  by 

h{d,  E)  =  -Jy-  2J  [^z""*  -  ^r*]  Pl^Hcosd) .  (37b) 

Here  k^  is  the  wave  vector  of  the  colliding  ji~  +  p  system.  The  polarization  of  yl^ 
is  given  by 

For  even  p^j^  only  I  =  0  will  be  affected  in  (37).  Thus 

da^{d,E)       da^{d,E,)^\k\    {-ImgiO,  E,)oc*',         E  >  E, 


dQ  dQ  ki    (      Reg{d,  Eo)oc*;         E  <  E^ 

Uefid,  Eo)(x*',         E>Eo 


PA0,E)  =  PA6,Eo)+^ 


Imfid,  Eo)oc*',         E<E, 


(39  b) 


with 


EPCR 


h{Q,  Eo)-iP{d,Eo)g{0,  Eo; 


dQ 
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For  odd  />^|  v,  only  /  =  1  will  be  affected  in  (37).  We  therefore  have 

do,,{e.E)       do^{e,Eoi^\k\   \-lml{0,Eo)oc*;         E  >  E^ 

+    IT      \  T^        ,  ,/.       t:,    V         *  7.T  TH  l*Ua> 


with 


where 


dQ  dQ  h,    \      R?l{0,Eo)(x*;         E  <  Eq 

\k\    lRefAd,EQ)(x*\         E>Eq 

/.(e.^o)=-'^-^«>7^/;ff"^^'>.  (400 

d<y^  (ff,  Jb^o) 
dQ 

m(d,  E^)  =  h{e,  Eo)  COS0  —  g{e,  E^)  sin0  (41a) 

1{Q,  Eo)  =  g(d,  Eo)  COS©  +  h(e,  Eq)  sin©  (41  b) 

We  thus  see  that  near  Eq  ,  the  cross-section  and  polarization  are  linfear  functions 
of  1^1  and  are  singular  at  the  threshold.  The  singularity  may  be  a  peak,  a  dip 
or  a  step.  The  width  of  the  singularity  corresponds  to  \k\</bi„.  If  we  assume 
that  only  8  and  P  waves  are  present  in  the  A^  and  K^  production,  then  g  and  h  do 
not  contain  higher  powers  of  cos0  and  sin0  than  the  linear  term.  We  see  from 
(39a)  and  (40a)  that  the  singularity  in  the  cross-section  is  determined  by  g{d) 
for  even  p^^  and  by  g{d)  cosO  +  h(6)  siaO  for  odd  Paz-  Thus  in  the  first  case, 
the  cross-section  does  not  have  a  cos^  6  dependence,  which  it  has  in  the  second 
case.  A  refined  analysis  of  the  angular  distribution  can  thus  decide  Pai:-  Figure  47 
describes  the  qualitative  results  of  the  above  discussion.  Turning  now  to  the 
experimental  data^,  we  indeed  find  that  a  a  (E)  rises  from  the  threshold  to  a  peak 
of  about  0.8  mb  near  970  MeV  when  the  kinetic  energy  of  the  pion  is  near 
970  MeV  and  then  drops  again  to  0.2-0.3  mb.  As  remarked  before  the  angular 
distribution  is  nearly  energy  independent  and  is  strongly  peaked  forward. 
Captures  at  rest  of  polarized  Z~  by  protons^  can  lead  to  polarized  A^,  i.e.  in 

!--{-  p  -^A'^+n.  (42) 

As  for  the  capture  at  rest  of  K'  by  p,  it  mainly  takes  place  from  the  S  orbits. 
Using  the  now  familiar  arguments,  we  can  easily  see  that  the  polarization  vec- 
tors Pa  and  p^:  ^re  identical  for  even  Pai:  while  Pa  will  have  a  component  along 
its  direction  of  motion  for  odd  Pa£-  Thus  the  angular  dependence  of  Pa  can 
yield  Pai:-  The  main  difficulty  is  the  production  of  polarized  i7°'s. 
Another  possibility  is  to  study  the  decays  of  polarized  Z^^  i.e. 

Z^-^A^  +  y.  (43) 

1  See  J.  Steinberg ER,  Int.  Conf.  on  High  Energy  Physics,  CERN,  Ed.  B.  Feretti  (1958). 

2  A.  Pais  and  S.  B.  Treiman,  Phys.  Rev.  109,  1759  (1958). 

^  G.  Feldman  and  T.  Fulton,  Nuclear  Phys.  8,  1  (1958).  See  also:  A.  P.  Balachandran 
and  N.  R.  Ranganathan,  Nuovo  Cim.  16,  1142  (1960). 
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The  transition  matrix  can  be  written  as 

T  =  A(j  '  {e  X  p^q)  p^^  =  even 

T=A'{a'e)  pA2:=odd,  (44) 

where  b  is  the  polarization  of  the  y-ray  and  p^o  is  a  unit  vector  in  the  direction 
of  the  A^  momentum  in  the  rest  system  of  Z^.  As  before,  the  polarization  of  A^ 
can  be  calculated.  We  have 

JP/i»  =  - !>£  +  2  f  X  Pao  {P2:  •  (e  X  Pao)}  ;  Pai:  -  even 

(45) 
-  -p2:+2eip^-e)',  p^^^odd. 

In  the  first  case,  p^o  has  a  component  perpendicular  to  the  photon  polarization 
and  in  the  second  case  it  has  only  a  component  parallel  to  it.  Thus  for  polarized  U^ 
decays,  we  have  a  test  which  can  serve  to  distinguish  between  the  two  possibilities. 
It  has  also  been  noted  that  the  branching  ratio  for  the  electromagnetic 
processes  ^ 

Z^^A^  +  e^  +  e- 

^yl«  +  7  (46) 

depends  upon  Pai:-  Assuming  a  local  hyperon  electromagnetic  interaction,  the 
ratio  to  the  first  order  is 

^  =  -jg^     for  even     Pa^: 

J  (47) 

^  =  ~IQ^     for  odd       p^^. 

Since  the  values  differ  by  12  per  cent,  we  can  hope  to  determine  Pa^:  by  an  experi- 
mental study  of  g.  The  capture  of  I!~  by  He*  can  also  lead  to  a  determination 

oipAs^- 

i;±  +  He4->^He^  +  j7;±.  (48) 

Assuming  spin  '^/^  for  ^He^,  this  problem  is  essentially  the  same  as  the  associated 
production  in  jv-p  colhsion  to  decide  p^y  If  ^^  has  a  polarization  jPj:  the  dif- 
ferential cross-section  can  be  written  as 


do 
dQ 


do 


dQ 


a  ±  Pi: -Pa),  (49) 

where  p    is  the  polarization  of  ^He^  and  (  -T7y  I  is  the  differential  cross-section 

when  |)2;o=  0  i.e.  for  unpolarized  2^'s.  As  before  the  plus  or  minus  signs  refer  to 
even  and  odd  Pas  respectively. 


1  G.  Feinberg,  Phys.  Rev.  109,  1019  (1958). 

2  A.  SiRLiN  and  R.  Spitzer,  Phys.  Rev.  Lett.  3,  1 10  (1959). 
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BelcUiiye  parity  pys-   Pn^  f^i^  be  studied  directly  by  the  reaction 

7C-  +  p->S^-^  K^  +  K^,  (50) 

In  this  process  the  two  final  A'-mesons  will  always  exist  in  a  state  of  even  parity. 
Thus  studying  the  energy  dependence  of  the  cross-section  and  angular  distri- 
bution we  can  determine  the  p^fs  rather  neatly.  However  very  few  cascade  events 
have  been  observed  so  far  to  permit  any  possibility  of  determining  p^^  now.  Per- 
haps we  can  also  draw  conclusions  regarding  p^s  from  the  study  of  the  absorption 
in  hydrogen,  deuterium  and  helium  bubble  chambers  of  cascade  particles.^ 


1  S.  B.  Treiman,  Phya.  Rev.  118,  366  (1959). 


CHAPTER  IX 

SYMMETRIES  AND  STRONG  INTERACTIONS 

1.  INTRODUCTION 

The  concept  of  an  interaction  in  quantum  field  theory  is  based  on  the  inclusion, 
in  the  Lagrangian  or  the  Hamiltonian  of  the  system  of  particles,  terms  coupling 
the  free  particle  fields.  The  entire  Hamiltonian  hence  consists  of  two  parts,  one 
representing  the  sum  of  the  uncoupled  or  free-field  terms  corresponding  to  the 
various  fields  and  the  other  consisting  of  the  interaction  terms.  Such  a  division 
is  not  only  meaningful  but  leads  to  quantitative  predictions  if  perturbative 
expansions  are  valid.  Since  the  field  variables  are  expressed  in  terms  of  an- 
nihilation and  creation  operators  and  space-time  wave  functions,  the  interaction 
term  in  a  local  field  theory  will  have  the  same  space-time  arguments  in  all  the 
fields  contained  in  it  and  thus  will  describe  the  annihilation  and  creation  of 
particles  at  a  space-time  vertex.  In  a  perturbative  expansion  it  is  assumed  that 
only  a  few  vertices  are  relevant  and  integration  is  performed  over  the  space-time 
arguments  of  these  vertices. 

Even  if  a  perturbative  approach  is  not  possible  as  in  the  case  of  the  strong 
interactions,  a  knowledge  of  the  exact  form  of  the  interaction  Hamiltonian  is 
still  useful,  at  least  for  a  partial  understanding  of  the  experimental  data  and  in 
predicting  a  few  results.  The  exact  form  of  the  interaction  Hamiltonian  will 
depend  on  the  space-time  and  the  intrinsic  properties  of  the  free  fields  and  also 
on  certain  a  priori  principles  of  invariances  which  are  believed  to  hold  true  in 
a  "reasonable"  description  of  Nature. 

In  the  theory  of  elementary  particles  we  are  hampered  by  the  fact  that  we  do 
not  have  field  equations  for  particles  other  than  the  electron,  photon  and 
neutrino.  The  ^-meson  can  be  treated  as  a  heavy  electron  while  the  nucleons 
are  assumed  to  be  Dirac  particles  under  certain  simple  conditions.  In  the 
case  of  the  strange  particles  even  these  assumptions  cannot  be  made.  However 
it  is  still  believed  to  be  possible  to  devise  interaction  terms  for  the  strange  particles 
without  any  detailed  assumptions  regarding  their  fields  except  for  the  inclusion 
of  their  intrinsic  attributes  like  spin,  strangeness,  isotopic  spin  and  parity  to 
account  for  the  well-known  conservation  laws.  This  approach  has  led  to  the 
d'Espagnat-Prentki^  Lagrangian  for  the  strong  interactions  of  the  elementary 
particles.  But  this  Lagrangian  involves  a  multiplicity  of  coupling  constants  which 


1  B.  d'Espagnat  and  J.  Prentki,  Progress  in  Elementary  Particle  Physics  and  Cosmic 
Ray  Physics,  Vol.  4,  p.  1,  North  Holland  Publishing  Company,  Amsterdam  (1958).  This 
article  also  contains  an  extensive  bibliography  on  this  subject.  See  also,  P.  Roman,  Theory 
of  Elementary  Particles,  North  Holland  (1960). 
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are  to  be  determined  from  experimental  data.  As  remarked  earlier,  this  is  rendered 
practically  impossible  in  view  of  om*  inability  to  compute  the  consequences  of 
any  strong  interaction  term. 

A  reduction  in  the  number  of  coupling  constants  is  thus  highly  desirable.  This  is 
achieved  by  building  into  the  Lagrangian  certain  a  priori  internal  symmetries  be- 
tween the  interactions  of  the  baryons  and  the  bosons.  In  such  an  approach,  a 
hierarchy  of  strong  interactions  with  different  symmetry  properties  are  introduced. 
It  is  usually  postulated  that  when  the  interaction  with  highest  symmetry,  usually 
taken  to  be  the  symmetry  of  the  four-dimensional  rotation  group,  is  alone  present 
the  baryons  and  sometimes  the  bosons  form  degenerate  fields.  The  observed 
multiplet  structure  of  the  baryons  and  the  bosons  are  the  result  of  the  perturbation 
of  the  original  symmetry  by  the  interactions  of  lower  symmetry.  These  ideas  have 
led  to  quite  a  few  models  of  strong  interactions  like  the  global  symmetry  model 
of  Sch winger  and  Gell-Mann.  the  cosmic  symmetry  model  of  Sakurai  and  certain 
other  models  due  to  Pais,  Salam  and  Polkinghorne,  Dallaporta,  Feynman  and 
others. 

Such  attempts  have  been  stimulated  by  the  possibility  of  experimentally 
checking  the  occurrence  of  the  postulated  symmetries.  The  initial  symmetry 
should  be  strong  enough  so  as  not  to  be  masked  or  destroyed  completely  by  the 
interactions  of  lower  symmetries,  in  which  case  it  is  impossible  to  conclude 
whether  the  initial  symmetry  is  realized  or  not.  Indeed,  at  present  the  K-  -  p 
interaction  data  suggest  that  the  global  symmetry  of  the  pion  couplings  to  the 
baryons  is  not  a  useful  concept,  since  the  iT-baryon  interactions  cannot  be  treated 
as  a  perturbation  in  view  of  the  large  cross-section  for  the  absorption  process, 
as  pointed  out  by  Dalitz  and  Tuan.^ 

2.  MODELS  OF  STRONG  INTERACTIONS 

The  d'Espagnat-Prentki  scheme  ^ 

The  first  complete  interaction  Hamiltonian  describing  these  strong  interactions 
was  due  to  d'Espagnat  and  Prentki  who  made  the  following  plausible  assump- 
tions, (a)  only  Yukawa  couplings  exist,  i.e.  the  interaction  is  linear  in  the  boson 
field,  (b)  the  total  charge  Q  and  the  total  baryon  number  N  are  conserved, 
(c)  the  interaction  is  charge  independent  which  implies  that  the  interaction 
behaves  like  a  scalar  under  arbitrary  rotations  in  the  three-dimensional  isospin 
space  and  (d)  the  interactions  also  conserve  strangeness. 

On  these  assumptions,  d'Espagnat  and  Prentki  were  able  to  show  that  the 
conservation  of  charge  is  equivalent  to  the  requirement  of  invariance  imder  a 
reflection  with  respect  to  the  plane  perpendicular  to  the  third  axis  in  the  isotopic 
spin  space  (i.e.  the  0:^-0:2  plane).  To  see  this  we  must  first  study  the  transformation 
of  an  isospinor  y)  under  a  reflection.  Let  /  be  the  unitary  operator  which  trans- 

1  R.  H.  Dalitz  and  S.  F.  Tuan,  Ann.  Phys.  10,  307  (1960). 

2  B.  d'Espagkat  and  J.  Prentki,  loc.  cit. 
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forms  ^  to  ^'  =  Iipl-^  =  Ay)  under  inversion.  Since  we  know  that  fry)  behaves 
like  a  vector  under  rotations  in  isospace,  we  have  now 

ipA-^rAyj  ==  ±  fry).  (1) 

The  plus  or  minus  sign  indicates  that  y)T^  can  either  be  a  pseudo  or  a  proper 
vector.  However  we  note  that  there  is  no  A  which  anticommutes  with  all  the  t 
matrices.  Further  a  unit  matrix  commutes  with  all  the  r  matrices.  Thus  ipry)  is 
a  pseudo  vector  and  A  must  be  of  the  form  A  =  k[I].  If  we  recall  our  discussion 
on  the  behaviour  of  spinors  imder  space  inversion  we  note  that  k  =^  ±  I,  ±i. 
But  the  cases  k  =  ±1  will  not  correspond  to  reflections.^  Thus  under  inversion 
we  have 


^  ~>y) 


±iy). 


(2) 


The  two  kinds  of  isospinors  are  respectively  termed  the  isospinors  of  the  first  and 
the  second  kind  and  are  denoted  by  ^  and  r}.  The  bilinear  co variants  that  can 
be  formed  from  ^  and  rj  are  given  below  in  the  table.  In  columns  1  and  2,  the 
two  spinors  may  refer  to  different  fields. 


Quantity 

1 

2 

3 

Isoscalars 

1^1 

rfr] 

Ir^n 

Isopseudoscalars 

|T2^ 

n-^iV 

^'r, 

Isovectors 

^T2T| 

rir^TTj 

$'rrj 

Isopseudovectors 

rrl 

ri^TTi 

^r^-cn 

If  we  denote  by  u  the  isoparity  quantum  number,  then  we  define  the  general 
formula  for  any  operator  y) 

Iy)I-^  =  y)'  ^  {±ify)  (3) 

with  I*  =  0      for  isoscalar 

=  ±2  isopseudoscalar 

=  ±2  isovector 

=  0      isopseudovector 

=  +  1  for  ^ 

=  —  1  for  ri 

where  /  is  the  inversion  operator.  Further  if  we  put  /  in  the  form 

then  the  eigenvalues  of  U  will  give  us  the  isofermion  numbers  if  we  call  ^  the 
isofermion  and  r]  the  anti-isofermion. 

Let  us  now  assume  that  strong  interactions  are  invariant  under  both 
rotations  and  reflections  in  isospace.  This  implies  that  for  the  strong  interactions 
both  /  and  U  are  constants  of  motion.  Since  the  electric  charge  is  also  a  constant 


^  In  the  ordinary  space  inversion,  the  matrix  of  transformation  is  ky^^  and  so  even  k  =  ±\ 
correspond  to  reflections. 
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of  motion,  let  us  assume  that  we  can  have  a  relation  between  the  charge  Q,  U 
and  /  of  the  form 

Q  =  ocI^  +  pU.  (6) 

Applying  this  relation  to  the  nucleon  doublet  we  find  a  =  1,  /S  =  ^/j  provided 
we  take  the  nucleon  to  be  an  isospinor  of  the  first  kind,  i.e. 

Q=I^+UI2.  (6) 

This  expression  has  an  elegant  geometric  interpretation.  We  recognize  that  a 
reflection  in  the  x^x^  plane  is  equivalent  to  the  combined  operation  of  an 
inversion  followed  by  a  rotation  of  n  about  the  third  axis  in  the  isospace.  The 
corresponding  operator  is 

Thus  the  invariance  under  R  will  automatically  lead  to  charge  conservation. 

The  above  expression  (7)  helps  us  to  fix  the  isotopic  spin  properties  of  the  strange 
particles.  The 27 particle  should  only  form  an  isopseudovector,  i.e.  U  =0  and  not 
an  isovector,  for  the  non-zero  value  of  U  (i.e.  ±2)  will  not  yield  Q=  ±1,0. 
We  can  also  conclude  that  N  and  K  are  isospinors  of  the  first  kind,  A  an  iso- 
scalar,  E  an  isospinor  of  the  second  kind,  and  jt  and  Z  isopseudovectors. 

Having  determined  the  transformation  properties  of  the  various  particles,  let 
us  now  proceed  to  construct  the  interaction  Hamiltonian  which  should  be  a  pure 
scalar  in  isospace  because  of  the  simultaneous  demand  of  invariance  under  re- 
flections and  rotations  in  isospace.  To  conserve  parity,  the  interaction  should  also 
be  a  scalar  in  the  ordinary  space.  In  the  following  let  us  assume  that  we  have 
chosen  the  correct  y-matrices  (denoted  by  7^)  to  satisfy  the  latter  demand  and 
concentrate  our  attention  on  the  needs  of  the  first  condition. 

(a)  The  tz-N  coupling:  This  is  seen  to  be^ 

(b)  The  coupling  AZtz:  This  can  be  written  as 

It  is  to  be  noted  that  since  A  is  an  isoscalar,  it  is  not  possible  to  form  either 
a  pseudo vector  or  spinor  with  it.  Thus  there  can  he  no  AAtz  interactions. 

(c)  The  i7i7jr  coupling :  Since  all  the  three  fields  are  isopseudovectors,  the  iso- 
scalar takes  the  form  _ 

G^iirp^r  X  ipj:)'7t. 

(d)  The  SSn  coupling:  This  case  is  similar  to  (a)  and  hence  the  isoscalar  is 

G^ifpstFrps-  71. 


1  We  will  be  using  the  particle  symbol  for  the  field  whenever  such  a  notation  is  found 
more  convenient. 
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Turning  now  to  iT-baryon  couplings  we  have  the  following  isoscalars^: 

(a)  ANK:  F^ip^F'tpAK  +  h- c 

(b)  ENK:  F^xp^rK  •  Fj^e  +h-c 

(c)  ASK:    F^ipArxpsr^K  +  h-c 

(d)  ISK:  F^^^r^sT^rK+h'C 

Thus  the  total  interaction  Hamiltonian  is 

=  G^iip-^  Ft tp^  Jt  +  G^i {ipA  r \p^ ct  -\-  h  '  c) 
+  i  G^{\p2: r X  \p2:)  3t  +  O^i  rp^Frxp^ ot 
+  F^iy}^ Fyj^K  +  h  '  c)  +  F^iy^^^  r  K  F \p^  +  h  •  c) 
+  F^(ipAFy)s'^2K  +  h'C) 
+  F^{^sFipsT^rK  +  h'C).  (8) 

P  is  ^5  in  the  case  of  H^  except  for  ZAti.  However /'=  1  or  75  in  Hj^  depending 
or  the  relative  parity  of  ^-mesons.  All  the  coupling  constants,  i.e.  eight  in 
number,  have  to  be  determined  from  experiments.  It  is  quite  obvious  that 
this  will  be  a  hopeless  task  to  attempt,  because  of  the  variety  of  the  couplings. 
Also,  we  cannot  treat  these  interactions  as  perturbations  because  of  their  strength. 
In  view  of  these  difficulties  theorists  have  made  another  approach  to  the  under- 
standing of  the  strong  interactions  by  attempting  to  study  the  symmetries  of  the 
system  composed  of  baryons,  pions  and  iT- mesons  from  a  priori  considerations 
and  using  these  to  construct  the  interaction  Hamiltonian.  These  symmetries  will 
relate  different  parts  of  the  system  and  hence  may  reduce  the  number  of  the  coupling 
constants.  All  these  investigations  are  based  on  the  common  belief  that  the  inter- 
actions can  be  classified  according  to  the  degree  of  symmetry  they  exhibit.  It  is 
further  assumed  that  when  the  interaction  possessing  the  highest  symmetry  alone 
is  present,  the  masses  of  the  baryons  and  perhaps  of  the  mesons  are  equal. 
The  observed  mass  differences  are  due  to  the  perturbations  by  interactions 
having  lower  symmetry  than  the  initial  one.  A  typical  example  is  the  mass  dif- 
ference between  the  neutron  and  proton  which  presumably  arises  due  to  the 
electromagnetic  interaction  which  has  no  charge  symmetry.  However  the  pion- 
nucleon  interaction  which  is  charge  independent  does  not  distinguish  between  the 
neutron  and  proton.  We  can  show  that  we  need  at  least  three  coupling  constants 
to  describe  the  strong  interactions.  Let  us  assume  that  both  the  n-  and  iT-baryon 
interactions  are  highly  symmetric,  i.e.  the  pion-baryon  and  iT-baryon  interactions 
can  be  described  through  two  coupling  constants.  To  account  for  the  baryonic 
mass  differences,  we  need  one  more  interaction  which  can  serve  as  a  perturbation . 
Since  it  is  difficult  to  postulate  this,  attempts  have  been  made  assuming  symmetry 
only  of  either  the  71:  couplings  (global)  or  the  K  couplings  (cosmic). 


1  t/5  refers  to  the  transposition  of  the  isotopic  spin  indices  only. 
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Global  and  cosmic  symmetries 

Sch winger  made  the  first  attempt  at  such  a  theory  of  strong  interactions.^ 
The  baryons  possess  three  types  of  charges,  the  nucleonic  charge  or  baryon  number 
N,  hypercharge  U  and  electric  charge  Q.  Among  the  bosons  the  ^'s  are  charac- 
terized by  Q  and  U  while  the  pions  possess  only  the  electric  charge.  Thus  we  are 
justified  in  regarding  the  nucleonic  charge  as  the  source  of  both  the  K  and  n  inter- 
actions, N  and  U  causing  the  ct  interactions  and  Q  inducing  the  electromagnetic 
interactions. 

It  is  possible  to  look  from  a  fundamental  point  of  view  at  the  hypercharge  U 
and  isotopic  spin/,  by  introducing  a  four-dimensional  isospace  instead  of  the  usual 
three-dimensional  one.  We  will  be  dealing  with  the  rotations  in  such  a  four- 
dimensional  isospace  later.  Let  it  be  assumed  for  the  present  that  we  need  four 
quantum  numbers  for  a  complete  specification  of  a  state  in  this  space  and  they 
are  the  isotopic  spin  /,  its  third  component  /g,  the  hypercharge  U  and  its  third 
component  U^ .  The  quantum  numbers  of  the  baryons  are  given  below. 


u 

u. 

/ 

Particle 

i 

V2 

V'2 

-(;)• 

i 

-V2 

V2 

-'(::•) 

0 

0 

0 

A 

0 

0 

1 

IJiU^  Z' 2-) 

When  all  the  interactions  are  switched  off,  all  the  baryons  are  assumed  to  be 
degenerate  and  to  possess  the  same  mass.  Since  the  K  interaction  depends  on  U 
and  /,  introduction  of  this  interaction  will  induce  a  mass  splitting  between  the 
two  groups  N  and  S  and  A  and  Z.  Turning  now  to  jr  interactions,  we  can  see 
that  the  coupling  of  pions  to  the  baryons  arise  both  from  the  nucleonic  charge, 
the  coupling  constant  being  gr^,  and  from  the  hypercharge,  the  coupling 
constant  being  Gfj.  Since  for  the  nucleons  both  U  and  N  are  of  the  same  sign,  the 
self  masses  caused  by  both  these  couplings  add,  while  for  the  S  these  two  contri- 
butions are  of  opposite  sign  as  N  and  U^  are  of  opposite  sign.  Thus  the  pion 
couplings  serve  to  remove  the  degeneracy  between  the  N  and  S.  Schwinger 
also  introduced  a,  KKjz  coupling,  making  use  of  the  parity  doublet  theory  which 
conserves  parity.  This  coupling  introduces  a  strong  /  dependent  coupling  between 
the  nucleons  and  iT-mesons.  Since  we  have  A ->  N  +  K  and  Z  ->  N  +  K  we  can 
see  that  the  pion  couplings  will  also  split  the  Z  and  A  masses.  However  this  KKn 
coupling  has  lost  much  of  its  appeal  with  the  overthrow  of  parity  conservation 
which  rendered  the  parity  doublet  theory  rather  academic.   As  we  shall  see 


1  J.  Schwinger,  Phys.  Rev.  104,  1164  (1956). 
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presently,  the  KKn  coupling  has  been  revived  by  introducing  an  odd  relative 
parity  betweeen  K^  and  K^. 

Sch winger^  in  a  recent  contribution  has  studied  the  problem  of  the  structure 
of  interactions  between  the  elementary  particles  more  fundamentally.  We  shall 
content  ourselves  here  with  giving  the  main  ideas  of  the  theory  without  going 
into  the  details.  The  general  basis  of  the  theory  is  that  as  we  proceed  from  the 
famiUar  weak  interactions  to  the  electromagnetic  interactions  and  then  to  others 
of  increasing  strength  we  always  reduce  the  multiplets  occuring  at  each  stage  to 
a  simpler  single  particle.  Or  in  other  words  the  inherent  multiplicity  of  this  par- 
ticle is  revealed  as  the  weaker  interactions  of  lower  symmetry  spoil  the  original 
higher  sjmimetry. 

According  to  Schwinger  the  elementary  particles  are  the  realizations  of  the 
four-dimensional  .internal  symmetry  in  terms  of  three  dimensions.  To  see  the 

viv  —  1) 
meaning  of  this,  we  note  that  a  space  of  dimension  i;  will  contain anti- 
symmetric matrices.  All  these  matrices  will  have  eigenvalues  ±1,0.  The  eigen- 
value 0  is  not  possible  in  the  two-dimensional  case.  For  v=Z,  we  have  three  anti- 
symmetric matrices  which  correspond  to  the  angular  momentum  operator  of  eigen- 
value unity.  These  matrices  give  us  the/  =^  1  representation  of  three-dimensional 
rotations.  However  for  ?;  =  4  we  have  six  antisymmetric  matrices  which  can  be 
interpreted  as  the  transformations  of  a  vector  quantity  having  four  components. 
If  we  look  at  this  space  as  (3  +  1)  dimensional,  we  obtain  the  7=1  and  /  =  0 
representations  of  the  three-dimensional  rotations.  But  we  can  form  linear  com- 
binations of  these  six  antisymmetric  matrices  as  follows.  If  I^^  denotes  the  anti- 
symmetric matrix  corresponding  to  rotation  in  the  ij  plane,  we  have 


/3==  1(^2  +  ^34) 
^3=   2  (-^12  —  -^34) 


(9) 


These  /g  and  K^  have  eigenvalues  ±  ^/g,  i.e.  these  correspond  to  spinor  represen- 
tations. In  this  approach  we  look  upon  the  four-dimensional  rotation  as  a  direct 
product  of  two  three-dimensional  rotations.  Thus  we  arrive  at  the  important 
result  that  the  six  fundamental  rotation  matrices  can  be  regarded  in  terms  of 
three  dimensions  either  as  the  direct  product  of  /  ==  1  and  7=0  representation 
or  as  the  direct  product  of  two  /  =  J  representations.  It  is  further  natural  to 
suppose  that  there  exist  fields  which  realize  these  equivalent  representations. 
The  distinction  between  the  integral  and  half-integral  /  representations  arise 
when  the  underlying  four-dimensional  symmetry  is  reduced  to  three-dimensional 
ones. 

As  a  clue  to  the  understanding  of  this  reduction  of  the  symmetry  let  us  turn 
to  the  well-known  breakdown  of  the  three-dimensional  isotopic  spin  symmetry  by 
the  electromagnetic  interactions  which  distinguish  the  x-^  —  iCg  plane  among  the 
three  planes.  The  remaining  two-dimensional  symmetry  leads  to  the  conservation 


1  J.  Schwinger,  Ann.  Phys.  2,  407  (1957). 
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of  the  charge.  Hence  it  is  natural  to  identify  /,2  with  Q.  However  in  terms  of  the 
three-dimensional  representation,  Q  will  appear  in  different  ways  according  to 
the  mode  of  breaking  up  the  four  dimensions  to  three.  For  /  =  \,  Q  =  I^^.  But 
for  /  -^  V2  representations,  /12  --  I^  +  K^.  Thus  in  general  we  can  write 
Q-^  I^+  \Y,  where  7  =  0  if  /  is  integral  and  Y  -  ±  1  if  /  =  1/2- 
The  baryons,  iT-mesons  and  pions  are  realized  as  follows: 


/ 

^ 

Particle 

V. 

1 

N 

V2 

-1 

E 

1 

0 

Z 

0 

0 

A 

V2 

1 

K\K^ 

V2 

-1 

K-,K^ 

1 

0 

n 

0 

0 

<T(?) 

It  might  be  quite  possible  that  Nature  does  not  realize  the  /  =  (0,  1)  representation 
for  bosons.  Notice  that  the  ti's  can  be  accounted  for  by  means  of  a  self-dual 
antisymmetric  tensor  which  possesses  only  three  independent  components.  This 
provides  a  way  of  realizing  the  /  =  1  representation  without  the  accompanjdng 
1  =  0  representation. 

We  have  thus  obtained  the  highest  symmetry  realized  in  nature,  i.e.  the  four- 
dimensional  symmetry  of  the  rotation  group.  The  next  task  is  to  split  the  initial 
multi-degenerate  single  fermion  field  and  a  single  boson  field  into  its  components 
which  describe  the  family  of  elementary  parties  as  they  are  observed.  This  must 
arise  due  to  a  sequence  of  interactions  of  successively  lower  symmetry. 

Sch winger  has  constructed  a  jr-baryon  interaction  which  is  highly  symmetric, 
i.e.  which  does  not  distinguish  between  /  =  1/2  and  /  =  0,  1  representations.^ 
The  nucleonic  charge  or  the  baryon  number  of  the  heavy  fermion  field  arises 
naturally  and  the  pion  field  is  dynamically  dependent  on  this  charge.  Such  a  tt 
interaction  is  said  to  possess  "global  symmetry".  At  this  stage  the  fermion  field 
possesses  4x16  components.  The  factor  four  arises  due  to  space-time  properties 
and  the  sixteen  "internal"  components  are  the  direct  product  of  the  four  com- 
ponents for  iso topic  spin  properties,  two  components  for  nucleonic  charge  pro- 
perties and  two  because  of  the  two  ways  of  realizing  the  three-dimensional  isospin 
space  from  the  four- dimensional  one. 

This  global  symmetry  of  jt-baryon  coupling  is  broken  by  if-meson  couplings 
which  possess  only  three-dimensional  symmetry.  The  degeneracy  between  the 
N  and  S  is  removed  by  assuming  different  parities,  i.e.  N  and  S  have  opposite 
parities  while  the  degeneracy  between  Z  and  A  vanishes  because  of  the  different 


J.  SCHWTNGER,    loc.  cit. 
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four-dimensional  symmetries  possessed  by  n  and  K  interactions  with  these 
particles. 

These  ideas  have  been  also  extended  to  the  lepton  family.  The  ^-meson  mystery, 
i.e.  its  large  mass,  is  supposed  to  be  explained  by  postulating  a  new  particle  a 
and  an  interaction  between  g  and  the  ^-meson.  A  family  of  bosons  with  7=1 
and  with  the  photon  as  its  neutral  member  exists.  The  charged  bosons  may  carry 
mass,  as  all  the  electrically  charged  members  of  the  lepton  family  do,  and  are 
expected  to  mediate  the  weak  interactions.  If  they  do,  their  mass  will  be  very 
large  and  the  origin  of  this  large  mass  will  again  be  a  problem. 

The  universal  7r-baryon  coupling  has  also  been  advocated  by  Gell-Mann  on 
phenomenological  grounds. ^  He  has  argued  from  the  available  data  on  photo- 
production  of  iC-mesons  that  jr-baryon  couplings  are  stronger  than  iT-baryon 
couplings.  From  this  view  point  it  is  justifiable  to  treat  the  iT-meson  interactions 
as  a  perturbation  on  the  initial  ji-baryon  couplings.  It  is  also  natural  to  presume 
that  the  yr-baryon  coupling  should  have  an  identical  form  also  for  all  the 
baryons.  This  is  not  possible  if  we  treat  i7  and  yl  as  an  isovector  and  an  isoscalar 
respectively.  So  instead,  two  new  isodoublets  are  introduced  besides  the  nucleon 
and  the  cascade  particle.  The  new  particles  Y^  and  Z^  are  defined  as 

YO^—iA"-!")  (10a) 

Zo^^iAo+S")  (10b) 

These  new  "mixture"  particles  are  linear  combinations  of  the  observed  yl^  and  27^ 
particles.  It  is  assumed  that  (i7+  Y^)  and  {Z^E~)  are  isodoublets.  There  is  of  course 
freedom  to  associate  either  Y^  or  Z^  with  Z^  or  27",  a  circumstance  which  has 
been  exploited  in  some  of  the  models  for  strong  interactions.  For  convenience 
of  reference  in  later  discussions,  let  us  collect  the  four  doublets  below. 


M:y  -.=(p)^  --(;>  --(i-) 


11) 


The  TT-baryon  interaction  takes  the  form 

+  iG,N,ry,N,^:jt.  (12) 

Gell-Mann  has  postulated  that  G^^  G^^  Gq=  G^,  i.e.  the  pions  interact  with  all 
the  baryons  in  the  same  way  and  with  the  same  strength.  This  implies  that  when 
no  other  interaction  is  present,  it  is  not  possible  to  distinguish  between  the  dif- 
ferent baryons  by  yr-meson  interactions,  i.e.  all  the  baryons  are  degenerate  when 
the  TT-baryon  interaction  alone  is  present.  Regarding  the  iT-baryon  interaction, 
no  assumption  is  made.  This  model  is  useful  in  the  sense  that  it  allows  the 


M.  Gell-Mann,  Phys.  Rev.  106,  1296  (1957). 
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K-meson  int<?raction8  to  be  treated  in  a  perturbative  manner.  However,  we  shall 
see  later  this  global  symmetry  of  jr-baryon  coupling  has  not  been  borne  out  by 
experimental  evidence,  particularly  from  K'-p  interactions. 

Sakurai^  has  developed  a  different  model  of  strong  interactions  in  which  the 
K-ha.Tyon  coupling  has  a  higher  sjrmmetry  than  the  Ti-baryon  coupling.  He  has 
pointed  out  that  the  conclusion  that  A'-meson  coupling  is  weaker  than  jr-meson 
coupling  is  not  justified  in  view  of  the  smaller  Compton  wavelength  of  the  K- 
meson.  He  is  in  disagreement  with  the  view  that  the  stronger  couplings  are  more 
symmetric  than  the  weaker  ones. 

Though  the  pion-baryon  interaction  allows  transfer  of  electric  charge  between 
bosons  and  fermions  yet  the  interactions  are  charge  independent,  while  the  electro- 
magnetic interactions  mediated  by  neutral  quanta,  i.e.  photons,  are  not  so  sym- 
metric. This  idea  is  applied  to  the  A-meson  interactions  which  allow  a  transfer 
of  the  hjT)ercharge  while  the  pion  interactions  do  not.  Thus  it  should  be  expected, 
if  the  analogy  were  correct,  that  the  K-meson  interactions  should  be  more  sym- 
metric than  the  n  couplings.  Also  this  high  symmetry  should  be  broken  by  the 
jr  coupling.  The  iT-meson-baryon  interactions  obtained  in  this  model  are  said  to 
possess  "cosmic  symmetry".  The  interaction  is  given  by 

[K]  =  pF[N^  'N^K^+  K\  .  N^K^  ±  {N,  -N^K^-N^^  N^K^)]  +  h'C.     (13) 

In  this  it  has  been  assumed  that  [N,  A,  K]  and  [N,I!,  K]  couplings  are  of  the  same 
sign  and  similarly  also  the  couplings  of  A  and  Z  with  the  S.  However  we  could 
have  chosen  them  to  be  of  opposite  sign.  Then  the  interaction  becomes 

[K]  =  pF[N,  -N'^K^^  N, .  N'^K^  ±  {N,  •N^.K^-N,-  N'^KM  +  h-c     (14) 
where  /   V'+  \  /_  yo\ 

These  interactions  are  obviously  charge  independent.  It  can  be  verified  that  the 
interactions  are  also  invariant  under  the  three  transformations: 


a) 

N.XTN,: 

Kt:>r, 

b) 

N^^-N^, 

N,-^N, 

c) 

-V.SriV^; 

K*:*K*; 

Invariance  under  (a)  simply  expresses  that  A-meson  couplings  are  hypercharge- 
symmetric;  while  the  invariance  under  (b)  expresses  that  K  couplings  do  not 
distinguish  between  the  two  baryon  doublets  having  zero  hyper  charge.  Invariance 
under  (c)  guarantees  the  hypercharge  independence  of  iT-meson-baryon  inter- 
actions, i.e.  they  are  independent  of  the  hypercharge  of  the  baryons. 

Since  the  K  couplings  exhibit  very  high  symmetry,  the  baryons  are  necessarily 
degenerate  when  these  couplings  are  present.  This  symmetry  needs  to  be  broken 


1  J.  J.  Sakurai,  Phys.  Rev.  113,  1679  (1959). 
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by  the  jr-baryon  couplings.  Sakurai  has  suggested  a  plausible  method  of  break- 
down of  the  cosmic  symmetry.  The  electromagnetic  interactions  while  breaking 
the  three-dimensional  isotopic  symmetry  pick  out  only  the  +  1  eigenstate  of  Tg , 
i.e.  only  the  proton  has  electromagneti  interaction  (ignoring  of  course  the  anom- 
alous moment  interaction.)  In  the  weak  interactions  it  is  (1  +  y^)'(p  that  is 
retained.  A  similar  idea  can  be  pursued  to  account  for  the  large  mass  difference 
among  the  baryons,  i.e.  N-S  mass  difference.  If  we  add  the  two  symmetric 
interactions  {N^rN-i^ -i-  N^rN^)  -  :nc  and  (NitN-i-N^^rN^)  •  ^  the  symmetry 
Nj_  '^  N^  is  broken  and  in  the  result  we  have  null  :^-N^  coupling.  An  identical 
approach  to  destroy  the  symmetry  between  iVg  ^^^  ^z  i^  <^f  ^^  avail  as  we  require 
a  triplet  and  a  singlet  instead  of  two  doublets.  Further  since  the  A-Z  mass 
difference  is  rather  small,  we  can  perhaps  infer  that  the  coupling  constant  for 
[AZn]  and  [HZtz]  must  be  of  the  same  order.  However  if  we  choose  the  couplings 
to  be  of  the  same  sign  the  original  symmetry  still  persists.  So  let  us  choose 
^Ai:n  =  — ^Ezn-  The  n  interaction  with  the  A  and  E  can  now  be  written  as 

oc[Ai:7i\  +  [EE7i\:=IxE'7i-A'i:'7t-E'A'7i 

ocN^xN^'Ti  +  N^xN^'n.  (16) 

As  regards  f/  =  0  baryons,  the  tz-  and  iT-meson  interactions  are  symmetric  but 
in  a  different  way.  The  mass  difference  arises  only  when  the  7r-nucleon  interaction 
is  switched  on.  The  pion-baryon  interactions  in  the  model  discussed  here  are 
characterized  by  two  coupling  constants.  Sakurai  has  also  suggested  that  the  n-N 
coupling  will  be  twice  0^2:n  or  ^esii- 

An  alternative  view  regarding  the  cosmic  symmetry  of  K  interactions  is  pos- 
sible.^ We  observe  that  in  the  electromagnetic  interactions,  the  total  rest  mass 
of  the  fermion  minus  the  total  rest  mass  of  the  antifermions  is  a  constant  of 
motion,  while  in  weak  interactions  this  conservation  law  of  fermion  mass  is 
violated.  Let  us  take  this  fact  seriously  and  draw  the  following  analogy  between 
the  TT-baryon  and  the  electromagnetic  interactions.  The  jr-baryon  interaction 
which  allows  a  change  of  fermion  charge  is  also  charge  independent  while  the 
electromagnetic  interactions  are  not  so.  So  correspondingly  we  can  expect  inter- 
actions which  allow  a  transfer  of  a  mass  between  fermions  should  not  depend  on 
the  mass  state  of  the  fermion.  Since  ^-mesons  couple  different  baryons,  we  should 
naturally  expect  that  iC-mesons  should  be  symmetrically  coupled  to  all  the 
baryons,  i.e.  we  obtain  the  cosmic  symmetry  model  oiK  interactions.  Assuming 
charge  independence  of  7r-baryon  interactions,  we  see  that  except  for  E-A-tc 
coupUng,  the  ji's  connect  the  same  mass  states  of  the  fermions.  Our  principle 
suggests  that  we  can  perhaps  put  the  E-A-n  coupling  equal  to  zero. 

These  considerations  suggest  a  model  of  strong  interaction  with  cosmic  sym- 
metry for  iT-baryon  interactions  while  we  need  the  three  coupUng  constants 
G^ ,  O^  and  G^  for  jr-baryon  interaction  where  G^ ,  G2  and  G^  correspond  to  G^ , 


^  A.  Ramakrishnan,  A.  P.  Balachandran,  and  N.  R.  Ranganathan,  Proc.  Ind.  Acad. 
Sci.  53,  1  (1961). 
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O2  and  (?3  of  the  d'Espagnat  and  Prentki  Lagrangian.  As  the  model  contains  four 
coupling  constants,  it  will  not  lead  to  any  result  in  contradiction  with  the  ex- 
periment (see  later  our  discussion  of  Pais'  work).  To  reduce  still  further  the 
number  of  the  coupling  constants,  we  may  draw  the  following  analogy.  The 
symmetry  of  ;r-baryon  interaction  is  broken  down  by  the  electromagnetic  inter- 
action in  a  definite  way  depending  on  /g  of  the  particle.  The  K-meson  inter- 
actions also  connect  different  hypercharge  states  of  the  baryons  and  consequently 
they  also  exhibit  a  great  symmetry  between  the  baryons  having  different  hyper- 
charge, a  result  used  by  Sakurai.  We  now  suggest  that  this  symmetry  is  broken 
down  by  the  pion-baryon  interactions  depending  on  the  value  of  the  hyper- 
charge. Hence  we  have  0^  =  —O^;  02  =  0.  This  implies  that  the  71  interactions 
are  cyUndrically  symmetrical  about  the  third  axis  of  the  three  dimensional 
hypercharge  space.  In  this  model  the  strong  interaction  Lagrangian  is 

^nt  =  0[N,y,rN,-N,y,rN,]7z 

-{N,NjK^  +  h-c],  (17) 

where  we  have  omitted  a  factor  F  in  the  K  interaction  which  can  be  1  or  iy^ 
depending  on  the  relative  parity  of  K.  As  such,  the  model  is  highly  sjmimetrical 
and  will  lead  to  results  in  contradiction  with  the  experiments.  To  remove  Z-A 
degeneracy,  we  can  set 

This  implies  that  we  use  the  two  new  doublets  N2  and  iV^3  for  the  interaction  of 
iT-meson  with  S  where 


^^'^-(-""^o)  K-(-ll 


(21) 


This  model  suffers  from  a  serious  draw  back  in  that  it  cannot  remove  N  —  S 
degeneracy. 

Boublet  approximation 

The  approximation  of  Gell-Mann,  introduced  by  means  of  the  "mixture 
particles"  involves  the  neglect  of  the  mass  difference  between  the  A^  and  Z^. 
This  scheme  of  grouping  the  eight  baryons  into  four  doublets  is  called  the  doublet 
approximation.  We  shall  digress  a  little  on  the  symmetries  of  this  scheme,  a  study 
of  which  will  be  useful  for  our  later  discussions.^ 

In  Fig.  48  we  have  placed  the  baryons  at  the  vertices.  The  pion,  neutral  and 
charged  iC-meson  transitions  are  indicated  by  drawing  horizontal,  vertical  and 
diagonal  lines  connecting  the  appropriate  pairs  of  baryons. 


1  Dallaporta,  Nuovo  Cim.  7,  200  (1957).  See  also  the  lecture  notes  on  "Strong  Inter- 
actions of  Strange  Particles  and  their  Symmetries"  by  N.  Dallaporta,  Tata  Institute  of 
Fundamental  Research,  Bombay  (1959). 
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We  have  treated  the  pair  of  baryons  connected  by  pions  as  isodoublets.  It  is 
natural  to  treat  the  baryons  connected  by  charged  and  neutral  ^-mesons  also  as 


Fig.  48.  Doublet  approximation. 

doublets  in  a  different  sense.  Accordingly  we  will  call  the  baryons  connected  by 
neutral  iT-mesons  as  strangeness  doublets  and  distinguish  them  by  the  third 
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component  of  a  strangeness  vector  cOg,  assuming  values  +  ^/a  and  —'^U. 
For  baryons  connected  by  charged  ^-mesons,  we  shall  use  a  quantum  number 
called  isostrangeness  spin  C3  which  takes  +^/2  or  — '^j^.  These  quantum  numbers 
of  the  various  baryons  are  listed  below.  Assuming  that  these  quantum  numbers 
are  conserved  it  is  easy  to  find  the  appropriate  quantum  numbers  of  the  bosons. 

BPCR      22 


338  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

The  most  important  feature  of  the  table  is  that  K*a  are  isosinglets,  i.e.  no  longer 
do  iC"^,  K^  form  an  isodoublet.  From  the  table  we  find  that  there  exist  two 
relations 

0  =  T3  +  Cs  (19a) 

U  =  (o^+  Cz  (19b) 

according  to  which  we  should  have  only  three  independent  conservation  laws 
apart  from  the  law  of  conservation  of  baryons.  However  in  the  Gell-Mann- 
Nishijima  scheme  we  have  apparently  two  conservation  laws,  i.e.  conservation  of 
Q  and  U.  But  it  should  be  not«d  that  there  exists  an  arbitrariness  in  assuming 
that  X+  has  positive  hypercharge  and  R-  has  negative  hypercharge  which  is 
removed  here  because  of  the  three  conservation  laws. 

In  this  scheme  there  exists  besides  the  usual  charge  conjugation  two  other 
types  of  conjugation.  The  antibaryon  is  obtained  from  the  baryon  by  charge 
conjugation  which  reverses  the  signs  of  Q,  U  and  N  which  implies  the  reversal  of 
signs  of  T3 ,  a>3 ,  .and  ^3 .  We  can  also  perform  this  simultaneous  reversal  of  Q, 
U  and  N  in  two  steps.^  (a)  Reversal  of  N  and  (b)  the  reversal  of  Q  and  U.  To 
comprehend  the  meaning  of  this  we  realize  that  we  can  go  from  p  to  ^  by  reaching 
S~  through  (b)  and  then  p  through  (a),  a  procedure  which  may  be  justified  as  follows. 
In  the  case  of  electrons,  the  wave  equation  satisfied  by  the  positron  is  obtained 
by  reversing  the  charge  and  replacing  the  wave  function  by  the  charge  conjugated 
wave  function.  However  the  heavy  fermions  seem  to  be  characterized  also  by  the 
baryon  number  and  hence  the  equation  satisfied  by  them  may  differ  from  the  Dirac 
equation  of  the  electron  in  that  it  may  contain  some  more  terms  involving  the 
nucleonic  charge  N.  Thus  it  is  plausible  that  we  should  obtain  the  antiparticle  wave 
equation  by  reversing  both  the  charge  and  the  baryon  number.  Since  Q  and  N 
are  independent  it  is  possible  to  reverse  them  separately  and  obtain  two  types 
of  conjugation  leading  to  different  particles. 

To  impart  further  physical  meaning  to  these  ideas,  let  us  imagine  that  there 

exists  a  fundamental  baryon  B^  with  N  =  +1  and  Q  =  U  =  O.To  get  the  baryons 

i\\  and  N^ ,  we  first  "clothe"  B^  with  K's  and  ^'s  and  then  subsequently  with  71's. 

On  clothing  these  again  with  K's  and  K'8,  we  obtain  N^  and  N^ .  The  procedure 

is  listed  below. 

B^  +  K'^-^p';  B^  +  K^->n' 

^  B^  +  K^-^S^';         B^  +  K--^S-'  •  ^^^^^ 

n'-\-7i^^>'P  p'  -^  n~  ->n', 

ni  -        „  '  /^  (20c) 

n  +  K^^S-  +  K^-^Y^;  n+ K' ^E'  +  K^-^I- 

In  this  picture  any  baryon  possesses  a  "spinor  core"  B^  (source  of  the  nucleonic 
charge  N)  and  its  Q  and  U  arise  from  the  meson  cloud.  We  can  define  a  "spinor 
conjugation"  which  reverses  the  nucleonic  charge  alone  without  affecting  Q  and  V 

1  P.  BuDiNi,  N.  Dallaporta  and  L.  Fonda,  Nuovo  Cim.  9,  316  (1958). 
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and  a  "boson  conjugation"  which  reverses  only  the  Q  and  U  of  the  surrounding 
meson  cloud.  It  can  be  seen  that  the  usual  charge  conjugation  is  a  product  of 
these  two  conjugations  and  that  the  two  faces  of  the  cube  of  Fig.  48,  i.e. 
pnE^Z^  and  Y^E'E^E'  are  related  by  boson  conjugation.  Further,  since  boson 
conjugation  connects  N-^  and  N^ ,  in  variance  under  it  demands  the  equality  of  tz 
and  K  couplings  with  these  doublets.  Invariance  under  the  spinor  conjugation  is 
also  guaranteed  provided  the  strong  interactions  are  invariant  under  charge 
conjugation.  It  is  necessary  in  order  to  split  the  N^-N^  degeneracy  to  introduce 
interactions  which  will  violate  separate  invariance  of  boson  conjugation  and  spinor 
conjugation  though  still  invariant  under  charge  conjugation. 

Pais  was  the  first  to  point  out  that  the  above  schemes  of  strong  interactions 
with  high  symmetries  are  not  warranted  by  the  experimental  evidence.^  For 
we  shall  see  presently  that  these  interactions  forbid  reactions  which  actually  do 
occur.  Let  us  write  down  the  interaction  Lagrangian  assuming  the  doublet 
approximation 

F^N^rN^K^-F^N\rN^K^  +  h'C  (21) 

Avhere  F  =^  I  or  iy^  depending  upon  the  parity  of  K+  and  K^.  To  satisfy  charge 
independence  with  the  customary  isospin  assignments  to  the  baryons  or,  in  other 
words,  to  obtain  the  d'Espagnat-Prentki  interaction  Lagrangian  from  this,  we 
have  to  impose  the  equality  of  the  coupling  constants  of  27  and  A  with  the  tz-  and 
iT- mesons.  Hence  we  have 

G,^G,=  G;  F,:=F,  =  Fj^;  F,=  F,=  Fj^.  (22) 

It  can  be  shown  that  if  the  doublet  approximation  holds  good  then  the  following 
reactions  are  forbidden. 

K"-  +  n->K°  +  p 

K-  +  p->K°  +  n 
ji^-fp->E^  +  K' 
K-  +  p->E^  ^71- 

The  doublet  approximation  guarantees  the  separate  conservation  of  i^ ,  CO3  and  ^3 
while  in  none  of  the  above  processes  are  these  quantities  conserved,  even  though 
Q  and  U  are  conserved.^  We  note  that  the  interaction  Lagrangian  (21)  with  the 
condition  given  by  (22)  is  invariant  under  the  transformation 

N^  ^  ^^3;    K+  ^  K^;    ^0  ->  K^ 

1  A.  Pais,  Phys.  Rev.  110,  574  (1958). 

2  Pais  uses  the  quantum  numbers  8^  and  S^  which  can  be  related  to  0)3  and  C3  as 

a  N  ^  ^  N 


(23) 
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iVj  — *  N^  implies  the  transformation  F®  — ♦27".  Let  us  consider  now  the  reaction 
Invariance  under  these  transformations  permit  us  to  write 

<i:-K-^\7t-p>  =  -]/2<2yK^\7i-py, 

^(r-i.^)=2-^-(roA-o).  (24) 

This  result  implies  not  only  the  equality  of  cross-section  for  Zy^L^  production  but 
also  an  identical  angular  distribution  of  the  charged  and  neutral  27's.  However 
it  is  well  known  that  charged  hyperons  tend  to  have  a  forward  peaking  while 
the  neutral  hyperons  have  a  backward  peaking.  Thus  we  have  shown  that  the 
doublet  approximation  fares  very  badly  and  the  strong  interactions  cannot  be 
described  with  these  coupling  constants  alone. 

Pais  modeP 

To  overcome  these  difficulties  Pais  suggested  that  the  relative  parity  p(K) 
between  K^  and  K^  can  be  odd.  This  is  possible  for  under  the  doublet  approxi- 
mation K^  and  K^  belong  to  different  isogroups.  If  ^1+  and  K^  form  an  isodoublet, 
the  relative  parity  has  necessarily  to  be  even.  Let  us  also  assume  that  the  relative 
parity  of  K^  and  A  is  even.  Then  the  K  interactions  become 

[K]  =  F^[N\iy,N^K^  +  N.N^K^]  +  Fi^[N^N,K^  -  N,(iy,)N,K^]  +  h'C    (25) 

With  this  assumption  of  odd  p{K),  a  significant  result  emerges,  i.e.  to  preserve 
charge  independence  in  pion-nucleon  interactions  the  equality  of  27  andyl  couplings 
with  71-  and  iiT-mesons  given  by  (22)  becomes  a  necessary  condition.  To  understand 
this,  let  us  calculate  the  contribution  of  if  interactions  to  the  self  masses  of  ^and^i. 
In  the  self  energy  diagrams  the  intermediate  state  for  p  involves  27+  and  K^  or  Z^ 
and  K^  while  for  the  neutron  it  will  contain  a  Y^  and  K^  or  27~  and  K^.  The  K'^ 
has  a  scalar  interaction  and  K^  has  a  pseudoscalar  interaction.  If  we  denote 
by  a  and  b  ,  the  scalar  and  pseudoscalar  matrix  elements  respectively  since  the 
contribution  to  p  and  n  should  be  equal  due  to  charge  independence,  we  find 

Fja^  +  Fjjb^  =  FJia^-\-Fjb^, 
{F}-Fh){a^-b^)  =  0, 
Fi  =  ±Fn     for     a=^b.  (27) 

If  we  assume  p{K)  to  be  even  we  have  a  =  b  and  thus  no  such  condition  emerges 
in  that  case. 

We  have  already  seen  that  the  relation  (22)  is  not  in  conformity  with  experiment. 
We  must  introduce  an  interaction  which  will  violate  the  separate  conservation 


A.  Pats,  Phys.  Rev.  112,  624  (1958). 
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of  Tg,  (Og  and  (^3.  An  interaction  of  this  type  was  suggested  by  Pais  and  involves 
the  K Kn  couphng.^  It  is  represented  by 

It  will  conserve  parity  if  ^(iC)  is  odd.  It  can  be  verified  that  it  does  not  conserve 
any  of  the  quantities  T3,  cog  and  ^g  and  is  not  charge  independent.  The  intro- 
duction of  this  interaction  will  allow  all  the  previously  forbidden  reactions. 

Through  this  interaction,  some  features  of  the  strange  particle  phenomenon 
like  scattering  of  iT-mesons  and  associated  production  can  be  qualitatively  under- 
stood. It  is  obvious  from  the  structure  of  the  interaction  that  while  charge  ex- 
change scattering  involves  an  exchange  of  a  single  pion,  the  direct  scattering 
involves  an  exchange  of  two  pions.  For  charge  exchange  scattering  the  virtual  71 

gives  a  factor  -^r— -jr—r-  where  oc  =  2p^lml  with  p  the  momentum  of 

[1  +  c<;(l  —  COS0)]2  /-  /     TT  r 

the  'incoming"  K^ .  This  factor  implies  a  forward  peaking  in  the  charge  exchange 
cross -section.  For  heavy  nuclei  there  will  be  a  suppression  of  this  forward  peaking 
due  to  the  Pauli  principle.  Pais  has  found  that  for  the  best  fit  with  the  experi- 
mental data  on  total  cross-section,  G^jj^^e,  the  unit  of  electric  charge  (with 
h  =  c  =  I).  The  direct  scattering  of  K^  can  be  treated  as  a  TriV^  scattering  with 
an  effective  iT^  7^2  interaction.  In  the  energy  range  100-200  Me V,  the  ji-N 
scattering  is  predominantly  through  /  =  ^/g  state  and  hence  it  is  to  be  expected 
that  the  ratio  of  K+  +  p  to  K+  +  n  scattering  will  be  9  :  1 .  Experimentally  we 
find  that  there  is  an  appreciable  difference  between  the  cross-sections  for  K^  +  p 
and  K^  +  n  direct  scatterings. 

This  theory  affords  a  plausible  explanation  of  the  well-known  forward  peaking 
of  charged  hyperons  and  backward  -peaking  of  neutral  hyperons  in  the  centre 
of  mass  system.  The  neutral  hyperon  production  occurs  through  3b  7i~-p  scat- 
tering followed  hj  KKtz  and  iT-baryon  interactions,  i.e.  the  scattered  n~  emits 
a  iT"  which  is  absorbed  by  p  which  turns  into  a  A^  or  E^,  or  the  scattered  p  can  emit 
a  K^  which  absorbs  7i~  becoming  a  K^.  Making  use  of  the  fact  that  {ji'-p) 
scattering  shows  a  very  marked  forward  peaking  in  the  centre  of  mass  system 
due  to  shadow  scattering,  we  can  understand  the  backward  scattering  of  neutral 
hyperons.  A  similar  procedure  can  explain  the  forward  peaking  of  i7+  hyperon, 
noting  thati7+  production  vertex  is  pseudoscalar.  Thus  on  evaluating  the  matrix 
element,  this  imparts  a  minus  sign  and  hence  gives  a  forward  peaking  oiE^.  It  is 
seen  that  Z~  production  is  not  possible  through  Gj^ij. 

The  experiments  on  the  angular  distribution  oi  7z~  arising  from  the  decay  of 
A^  or  27"  produced  in  7i~-p  colHsions  indicate  that  there  exists  an  "up-down" 
asymmetry  about  the  production  plane  (defined  by  the  momentum  of  the  in- 


^  A.  Pats,  loc.  cit.  Such  a  cubic  boson  interaction  like  KKn  is  seen  to  be  prohibited 
if  we  demand  in  variance  under  y^  transformation,  since  this  invariance  would  imply 
<^^ —0  simultaneously  with  ip-^y^y}.  See  A.  Ramakrishnan,  A.  P.  Balachandran  and 
N.  R.  Ranoanathan,  loc.  cit.  See  also  G.  Baym,  Phys.  Rev.  117,  886  (1960). 
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coming  n'  and  the  momentum  of  the  hyperon)  in  the  case  of /I",  and  no  asymmetry 
in  the  case  of  Z".  This  result  can  be  understood  if  it  is  assumed  that  the  produced 
A^  are  polarized  while  the  E~  are  not.^  This  assumption  is  justified  in  a  qualitative 
way  assuming  piK)  to  be  odd.  The  invariant  transition  matrix  elements  for  the 
two  production  processes  can  be  written  as 


V^.  =  A'  +  iBa^'ip^.xpjj-) 


(28) 


The  difference  in  structure  follows  from  the  fact  that  while  K^  is  a  pseudoscalar, 
/if+  is  a  scalar.  Assuming  that  A,  B,  A'  and  B'  are  real  we  have  maximum  polari- 
zation for  A^  production  and  no  polarization  in  case  of  Z~.  We  should  also  expect 
w°  to  be  polarized  as  much  as  A^. 

This  Ox  11  interaction  will  destroy  charge  independence  in  strange  particle 
reactions,  evidence  for  which  is  not  conclusive.  Besides,  it  can  in  no  way  generate 
the  baryon  mass  spectrum.  Indeed,  Pais  has  posed  the  question  whether  there 
exists  as  yet  an  unknown  mechanism  of  mass  displacement  hidden  behind  the 
actually  observed  mass  spectrum  of  the  particles. 

Salam-Polkin^horne  model  ^ 

Earlier  in  this  chapter  we  have  studied  a  model  of  strong  interactions  in  which 
we  have  made  use  of  the  symmetries  available  in  a  four-dimensional  isotopic 
spin  space.  We  will  now  refer  to  some  other  models  of  strong  interactions  of 
elementary  particles.  In  these  theories  as  in  Sch winger's  the  elementary  particles 
are  realized  as  irreducible  representations  of  the  four-dimensional  rotation  group. 

In  a  four-dimensional  real  orthogonal  space,  there  are  six  possible  rotations, 
the  infinitesimal  generators  of  which  are  4x4  matrices  I^p  where  oc,  ^  =  1,2,3,4. 
These  matrices  can  be  represented  analytically  as 

{I.f)ye=(K^S-^.,^l,y),  (29) 

where  d^y  etc.  is  a  Kronecker  delta  and  ocp  define  the  particular  matrix  while  yd 
define  the  rows  and  columns.  As  mentioned  before,  it  is  possible  to  group  these 
I^pS  such  that  we  obtain  two  distinct  three-dimensional  rotation  groups.  To  do 
this,  let  us  define 

IJ^Hh.-Iu), 

^  Though  this  explanation  suffices  for  U"  decay,  the  experimental  results  on  U'^  decay  show 
that  there  is  a  maximum  up-down  asymmetry  ia  p  +  n^  channel  while  there  is  no  asymmetry 
in  w  +  71"*"  channel.  Obviously  this  implies  that  the  27+  are  polarized  when  they  are  produced. 
A  more  satisfying  explanation  is  that  the  decay  inn  +  n^  channel  of  the  27+  and  the  n  +  7i~ 
of  the  Z~  are  parity  conserving  while  the  p  +7i^  channel  is  non-parity  conserving.  This  situation 
is  possible  provided  that  V-A  interaction  is  operative  in  p  +  n^  while  F  or  ^  alone  causes 
the  other  two  channels  (see  the  chapter  on  "Weak  Interactions"). 

2  A.  Salam  and  J.  C.  Polkinghorne,  Nuovo  Cim.  2,  685  (1955). 
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where  i,  k,l  =^  1,  2,  3  and  all  the  other  cyclic  permutations  thereof.  We  can  verify 
that  the  following  commutation  relations  hold  good. 


If  we  let 

ut,  m  = 

-It 

(31) 
(32) 

tnen 

[li,  4]  = 

il, 

[K„K^\  =  iKt          [Ki,I,]  =  0. 

(33) 

Since  I  and  K  commute  and  obey  commutation  relations  just  like  angular  momen- 
tum operators,  P  and  K^  can  be  simultaneously  diagonalized  and  the  eigenvalues 
areO,  ^/2,  1  .  .  .  .  As  usual  we  can  assume  the  third  component  of/  and  K,  i.e. /g 
and  K^  to  be  diagonal.  Any  representation  of  these  rotations  can  be  labelled 
by  a  pair  of  numbers  i  and  k  denoting  the  lengths  of  P  and  K^.  The  dimensions 
of  the  representation  will  be  given  by  (2i  +  1)(2A;  +  1). 

A  classification  of  the  elementary  particles  and  their  interactions  can  be  now 
attempted  by  realizing  the  particles  as  certain  irreducible  representations  of 
the  four-dimensional  group.  Salam  and  Polkinghorne  have  discussed  the  following 
model.i  The  A^  hyperon  is  treated  as  a  scalar,  the  i7  's  as  a  self-dual  antisymmetric 
tensor  (only  three  independent  components)  while  the  nucleons  and  cascades 
are  treated  together  as  a  four  vector  in  this  space.  The  iT-mesons  form  a  four 
vector  while  the  ji-mesons  form  a  self-dual  antisymmetric  tensor.  We  list  below 
the  quantum  numbers  involved  in  such  a  description : 


Particle 

Description 
and  symbol. 

I 

K 

h 

K, 

7l~ 

Pseudoscalar 
Isoself-dual 
tensor. 

0 

1 

0 

-1 

0 
0 
0 

K- 

[Pseudoscalar  (?) 
Isovector 

% 

% 

V 
n 

Dirac  spinor] 
{ Isovector 

V. 

V. 

-V. 

'/2 

-V2 

.40 

1  Dirac  spinor 
[Isocalar  ip^ 

0 

0 

0 

0 

2:- 

(Dirac  spinor 
llsoself-dual 
[tensor  v>f/3 

1 

0 

1 

0 

-1 

0 

0 
0 

A.  Salam  and  J.  C.  Polkinghorne,  loc.  cit. 
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It  is  seen  that  while  /  and  /g  give  the  customary  isospin  assignments  of  the  multi- 
plets,  K^  is  related  to  the  hypercharge  of  the  d'Espagnat-Prentki  scheme 
discussed  earlier  by 

We  can  also  note  that  the  charge  Q  is  given  by  Q  =  I^  -j-  K^. 

Having  classified  the  elementary  particles  let  us  study  the  symmetries  of  tlic 
interactions  between  them.*  In  accordance  with  our  general  belief  concerning 
the  symmetries  and  strengths  of  the  interactions,  it  is  natural  that  we  should 
expect  that  the  strong  interactions  are  invariant  under  all  the  rotations  in  the 
four-dimensional  space.  This  implies  that  /,  K,  /g  and  K^  are  conserved.  The 
selection  rules  for  the  strong  interactions  may  be  written  as 

J/=0;        AK=0.  (34) 

Since  the  electromagnetic  interactions  discriminate  between  the  members  of 
the  multiplets,  the  selection  rules  for  the  electromagnetic  interactions  will  be 

J/3  =  0;        AK^  =  0.  (35) 

The   weak  interactions   cannot  be   invariant   under  separate  rotations  in  the 

two  spaces  but  due  to  charge  conservation  will  conserve  /g  +  K^.  To  impart 

a  geometric  meaning  let  us  define  an  M  space  in  which  the  generators  of  rotations 

are  given  bv 

Mi  =  It  +  Ki=^I„. 

Setting  L^^  —  i  M^ ,  it  follows  that  L^  satisfy  the  angular  momentum  commu- 
tation rules.  The  eigenvalues  corresponding  to  the  operator  L^  are  given  by  L 
which  can  take  values  0,  ^/a,  1  .  .  . .  The  M  space  in  which  Jvj*  are  rotation  operators 
form  a  subspace  which  is  orthogonal  to  the  fourth  axis  of  the  isospace.  By  defi- 
nition the  range  of  eigenvalues  of  L  is  given  by 

L  =  I  +  K,    1+  K—l...\I-K\. 

It  is  now  clear  that  the  weak  interactions  should  conserve  M^,  i.e.  they  should 
be  invariant  under  rotations  about  the  third  axis  in  this  space. 

We  can  now  envisage  two  classes  of  strong  interactions,  i.e.  very  strong  (VS) 
and  medium  strong  (MS).  As  before,  let  us  demand  that  the  very  strong  inter- 
actions are  invariant  under  all  the  rotations  in  the  /  and  K  spaces  and  hence 
in  M  space.  We  will  characterize  the  (MS)  interactions  by  their  invariance  under 
all  the  rotations  in  /  space  and  also  under  rotations  around  the  third  axis  in  the  K 
space,  the  latter  being  necessary  to  conserve  Q.  The  (MS)  interactions  are  thus 
not  invariant  under  arbitrary  rotations  in  K  space  but  rotations  around  the  third 
axis  in  K  space  leave  the  (MS)  interactions  invariant.  The  lower  sjrmmetry  of 
the  (MS)  interactions  allows  us  to  resolve  the  NS  degeneracy. 


^  J.  C.  PoLKiNGHORNE,  NiLovo  Cim.  6, 864  (1957) ;  B.  d'Espagnat,  J.  Prentki  and  A.  Salam, 
Nud.  Phys.  o,  447  (1958);  B.  d'Espagnat,  and  J.  Prentki,  NucL.  Phys.  6,  596  (1968). 
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The  very  strong  interactions  should  be  scalars  in  the  four-dimensional  space 

and  are  built  up  of  baryons  and  pions.  The  interaction  Lagrangian  can  be  written 

as 

S^  ^  G,  ip^  y,  ipf  cp^p  +  G^  y)f ^  y,  y>f^  99^,  +  G^  [if>i^  y,  ^^'  cp^p  +  h-c].      (37) 

where 


Wo 


P 


ip 


i-Wp  +  Ws- 


iWp  +  Vs- 


Wn  -  Ws") 


and 


_ip 
1 


i-Wn  +  WS') 


(38) 


1/2 


(-^2: 


1 

Jp 


iw 


E  + 


W2 


We-) 


(39) 


Similar  expressions  hold  for  (pf  and  (p^^  respectively  like  \p^  and  ^f^ 
In  terms  of  the  components,  S^  becomes 

^^  ==G^i\xpNy^r\pN:!t^-fpsy^ry)s:ft\ 

+  4G2 *[iFi:75  X  tf^r]  :7r  +  4 (^3 ^ [{f?^ 75 ^^  jr  +  ^  .  c] 


(40) 


Except  for  a  reduction  of  the  coupling  constant  from  four  to  three  due  to  the 
N  —  S  degeneracy,  the  pion  interaction  is  identical  with  the  corresponding  part 
of  d'Espagnat-Prentki  Lagrangian. 

Since  the  (MS)  interactions  are  invariant  under  all  rotations  in  the  /  space 
and  under  rotations  about  the  third  axis  in  the  K  space,  it  follows  that  L^  must  be 
a  scalar  in  the  four-space  plus  the  third  component  of  a  vector  in  the  K  space. 
The  electromagnetic  interactions  which  are  invariant  under  rotations  about 
the  third  axis  in  the  usual  three  dimensional  isospace  have  a  similar  structure. 
Let  us  attempt  to  formulate  the  scalar 

^f  -  F.ipS  YsW^'^pS  +  F,ip?  y.n^fpcpf  +  h-  c  (41) 

-  F2  [Wn  t  (Pk  75  ^'2  +  ^  V^^  T  T2  99^  75  x^y^  +  h'C]. 

The  third  component  of  a  vector  in  K  space  is  easily  formed  by  inserting  between 
the  vectors  y)f  and  ^^Jthe  third  component  of  the  rotation  operator  in  K  space. 
We  thus  have 

^f  =  iK iWN 75  WaVk-^  Ws ^2 rK 75 Wn  +  ^  -  c)  (42) 

-  i  ^2  (Wn  r  cpK  75  ^'r  -iy>E^  ^2  99^  75  ^f'r  +  ^  '  c)  • 
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Sf  is  identical  with  the  part  of  d'Espagnat-Prentki  interaction  dealing  with  K- 
meson  interaction  except  that  the  coupling  constants  have  been  reduced  to  two. 
On  comparing  S*^'  and  S^' ,  the  only  difference  is  that  in  S^  the  N  and  S  interact 
mth  X-mesons  with  coupling  constants  having  opposite  sign.  S^  and  S^  by  them- 
selves  alone  are  not  sufficient  to  resolve  the  N-S  degeneracy  since  self  mass 
depends  on  the  square  of  the  coupling  constants.  If  however  we  set  the  strong  K- 
meson  Lagrangian  as 

S^=Sf+S^  (43) 

then  we  have  four  coupling  constants  and  the  interaction  is  identical  with  that 
of  the  d'Espagnatr-Prentki  Lagrangian.  This  naturally  resolves  the  N-E 
degeneracy. 

We  wish  to  stress  the  fact  that  to  resolve  N-S  degeneracy  we  must  only  treat 
the  iC-meson  interaction  as  medium  strong.  The  choice  of  very  strong  K  inter- 
actions and  medium  strong  n  interactions  will  not  split  the  N-E  degeneracy .  Of 
course  we  can  still  remove  the  degeneracy  arbitrarily  by  removing  the  coupling  of 
the  pion  to  E  (Sakurai  model). 

Analogous  to  the  electromagnetic  interactions,  the  (MS)  interactions  will 
satisfy  the  selection  rule 

|zlK|  =  0,  1        Al=0.  (44) 

It  is  possible  to  discuss  a  scheme  of  weak  interactions  by  demanding  that 
they  are  not  only  invariant  under  rotations  around  the  third  axis  of  the  if  space, 
but  also  are  invariant  under  arbitrary  rotations  in  M  space.  We  shall  not  go 
into  details  here  except  pointing  out  these  M  invariant  interactions  have  the 
property  that  they  can  give  rise  to  both  1^1  / 1  =  ^/g  and  ^/g  transitions.^  Let  us 
consider  the  decay  A^  ->p  -\-  n~.  The  initial  state  has  /  =  0,  JiT  =  0  and  X  =  0 
while  the  final  state  has  /  =  ^/g,  ^/g,  K  =  ^/g  and  iy  =  2,  1,  0.  Let  us  suppose 
that  only  weak  and  (VS)  interactions  are  allowed.  Then  zd  L  =  0,  which  implies 
that  in  the  final  state  /  =  i/g  or  |zl  / 1  =  ^/g.  The  [^  / 1  =  '/2  admixture  can  arise 
from  the  (MS)  interactions.  The  (MS)  interactions  can  enter  the  decay  of  A 
when  the  particle  splits  first  into  a  K^  and  n  while  in  the  absence  of  it  such  a 
{AN K)  vertex  cannot  occur.  The  violation  of  |/d/|  =  ■^/2  rule  will  not  be  very 
strong  because  of  the  (MS)  interactions.  It  is  to  be  noted  that  except  for  the  decay 
of  K^  into  two  pions  (though  slow  compared  to  K^)  all  the  other  non-leptonic 
decays  of  strange  particles  seem  to  obey  l/d/|  =  '^1^  rule  perfectly. 

A  scheme  for  including  leptons  in  this  four  isospace  has  been  discussed  which 
is  however  far  from  satisfactory. ^ 

Pais  has  also  studied  the  classification  of  the  symmetries  of  interactions  on  a 
formal  basis  using  a  four-dimensional  isospace.^  We  shall  not  here  deal  with  his 


^  It  should  be  remembered  that  a  four  vector  will  split  into  a  three  vector  plus  a  scalar, 
a  self  dual  antisymmetric  tensor  will  be  a  three  vector  and  a  four  scalar  will  become  three 
scalar  in  the  M  space. 

2  I.  Saavedra,  Nud.  Phya.  10,  6  (1959). 

3  A.  Pais,  Proc.  Nat.  Acad.  Sci.  40,  484  (1954). 
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first  model  as  it  gave  rise  to  many  non-observed  particles.  The  second  model^  is 
more  realistic  and  is  intimately  related  to  the  doublet  approximation.  The  quan- 
tum numbers  for  the  elementary  particles  are  listed  below. 


Particle 

I 

h 

K 

K, 

P 

Va 

V2 

0 

0 

71 

-V2 

0 

2-+ 

V2 

V2 

V2 

V2 

yo 

-V2 

V2 

Z« 

'U 

V2 

V2 

-V2 

z- 

-V2 

-V2 

E^ 

'U 

+V2 

0 

0 

E" 

-V2 

0 

n^ 

1 

0 

jr« 

1 

0 

0 

0 

n~ 

-1 

0 

K^ 
K^ 

0 

0 
0 

V2 

V2 
-V2 

K- 

0 

0 
0 

'k 

V2 
-V2 

Having  assigned  representations  to  the  particles,  the  symmetries  of  various 
interactions  are  assigned.  The  71  interactions  have  the  highest  symmetry  in  that 
they  conserve  all  the  quantum  numbers  I,I^,K  and  K^.  Due  to  the  oddness  of 
p{K)  the  introduction  of  iiT- meson  coupling  destroys  the  equivalence  between 
the  states  with  K^  =  +1^2  ^^^  K^  =  —  Fo .  For  the  ^-meson  interactions  /,  /o 


and  K^  are  conserved  but  not  K.  As  in  the  Salam-Polkinghorne  scheme, 
the  electromagnetic  interactions  conserve  I^  and  K^ .  The  K  Kn  interactions 
happen  to  be  the  weakest  among  the  class  of  the  strong  interactions  in  that  even 
separate  conservation  of  I^  and  K^  no  longer  holds  good.  Only  /g  +  iTg  is  a  good 
quantum  number. 

We  shall  now  refer  to  a  model  of  strong  interactions  in  which  all  the  baryons 
are  grouped  together  by  treating  them  as  different  ''charge"  states  of  a  single 
field  which  will  be  described  by  a  32-component  spinor  wave  function.^  As  before, 
introducing  a  four-dimensional  hypercharge  space,  we  form  a  Tr-baryon  interaction 
which  is  left  invariant  under  rotations  in  a  three-dimensional  subspace,  i.e.  the 
isotopic  spin  space.  As  in  the  Salam-Polkinghorne  model,  the  ^-baryon  inter- 
action is  a  sum  of  two  terms,  the  first  one  being  a  scalar  in  the  four-dimensional 
space,  the  second  transforming  like  the  third  component  of  a  vector  in  the  other 
three-dimensional  subspace.  The  N-S  degeneracy  is  removed  because  of  the 
difference  in  the  coupling  constants  of  K  with  N  and  S.  To  remove  the  A  and  E^ 
degeneracy,  use  is  made  of  the  freedom  in  defining  the  doublets  N^  and  N^ ,  i.e. 

1  A.  Pais,  Phys.  Rev.  112,  624  (1958). 

-  J.  TiOMNO,  Nuovo  Cim.  6,  69  (1957);  N.  Dallaporta  and  Tayoda,  NuovaCim.  14,  142 
(1959);  N.  Dallaporta  and  V.  De  Santis,  N^u)vo  Cim.  14,  225  (1959);  N.  Dallaporta  and 
L.  K.  Pandit,  Nuovo  Cim.  1^,  135  (1960). 
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we  can  exchange  Y^  and  Z^.  If  we  add  the  two  Lagrangians,  A  and  U  have 
different  coupling  constants  with  7t  and  if-mesons.  Except  for  AZtz  coupling 
constant,  all  the  other  pion-baryon  coupling  constants  satisfy  global  symmetry. 

Feynman's  modeP 

We  will  now  take  up  a  very  speculative  model  of  strong  interactions  due  to 
Feynman  who  obtains  a  relation  between  the  square  of  the  baryon  mass  M  and 
the  quantum  numbers  like  strangeness  S  and  the  total  isotopic  spin  /  which 
are  conserved  in  the  strong  interactions, 

3f2  -  a  +  6/(7  !    l)  —  cS.  (45) 

The  three  constants  are  determined  from  the  masses  ofp,  A  and  S'.  The  constants 
so  obtained  give  correctly  the  masses  of  the  Z  hyperons.  In  this  expression  we 
neglect  the  mass  diiference  within  a  multiplet.  Naturally  the  question  arises 
whether  it  is  possible  for  any  scheme  of  strong  interactions  to  produce  this 
formula. 

We  now  recognize  that  /(/  +  1)  is  the  eigenvalue  of  the  square  of  the  isotopic 
spin  vector  /  and  hence  if  the  interaction  of  a  particle  depends  upon  /,  it  should 
jdeld  such  a  term.  Obviously  a  particle  coupled  to  /  should  transform  like  an 
isotopic  vector.  Thus  it  is  suggested  that  the  pion  (isovector)  is  coupled  to  the 
total  /. 

In  the  absence  of  the  pion  coupling  A^  and  ly  have  the  same  mass  so  that  the 
/T-meson  couplings  do  not  destroy  this  degeneracy.  This  can  be  achieved  by 
\^Titing  a  K  interaction  Lagrangian  which  preserves  isotopic  spin  and  strangeness 
and  setting  F^  =  F^  ==-  Fj,  F^  =-^  F^  =  F^ .  Since  Fj  4=  F^  there  will  be  no  longer 
a  degeneracy  between  the  A^  and  the  Z^  in  the  fourth  order.  Assuming  global 
symmetry  for  pion  interactions  in  this  model  of  strong  interactions,  we  have  three 
coupling  constants  G,  Fj  and  Fjj . 

Let  us  assume  that  in  the  absence  of  these  couplings  all  the  baryons  have  equal 
masses.  In  the  absence  of  pion  couplings,  the  contribution  of  K  interactions  to 
the  self- masses  of  p  (n)  and  S-  {S^)  is  proportional  to  Ff  and  Ffj  respectively 
while  for  other  baryons  it  is  proportional  to  FJ  +  FJj .  According  to  this  we  should 
expect  that  the  square  of  the  mass  of  the  degenerate  E  and  A  should  be  the 
average  of  that  of  the  nucleon  and  the  S.  This  is  precisely  what  is  expressed 
by  the  term  — cS.  Thus  we  have  developed  a  model  of  strong  couplings  which 
can  jdeld  the  formula  given  by  (45). 

In  order  to  study  the  symmetry  properties  of  the  strong  couplings  we  suppose 
that  the  pion  couplings  are  not  strong,  i.e.  the  perturbation  treatment  of  the 
pion  couplings  is  valid.  Near  resonances  involving  pion  couplings  this  assumption 
will  break  down.  The  reason  for  regarding  pion  coupling  as  not  strong  seems  to  be 

^  R.  P.  Feynman  (unpublished).  Professor  Feynman  feels  that  the  physicist's  problem  is 
not  just  to  make  speculative  models,  but  really  to  find  out  what  is  going  on  in  nature.  In 
his  opinion,  this  model  is  a  speculation  and  almost  certainly  does  not  describe  the  observed 
reality.  However,  we  wish  to  include  it  to  demonstrate  the  gross  inadequacy  of  conventional 
ideas  to  answer  the  challenge  of  strong  interactions  and  the  necessity  for  new  ways  of  thinking. 
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favoured  because  the  A^  —  2"^  mass  difference,  caused  by  pion  couplings,  in  this 
model,  is  smaller  than  the  other  mass  difference  between  the  baryons. 

In  the  absence  of  the  pion  couplings  let  us  attempt  to  define  a  few  quantum 
numbers  to  describe  the  symmetry  of  the  system.  We  see  that  the  K-meson 
couples  p,  Z^,  E^,  E^  together  and  so  also  n,  Y^,  E~  and  S~.  These  two  sets  are 
independent.  In  each  set  we  have  two  single  independent  members  and  a  doublet. 
Let   us   call  these   quantum   numbers  by  A^  and  ^3,  which  are  listed  below. 


As 

^3 

V 

V2 

0 

2J+ 

V2 

V'a 

Z 

V2 

-V2 

z-o 

1/., 

0 

n 

-V2 

0 

y;~ 

-V2 

-V2 

yo 

-V2 

+V2 

E" 

-V-2 

0 

K- 

0 

V2 

K' 

0 

-V2 

K^ 

0 

-V2 

Ko 

0 

V2 

As  before,  we  can  interpret  A.^  and  B^  as  the  third  component  of  two  three-dimen- 
sional vectors  obtained  from  a  four  dimensional  space.  A^  is  identically  equal  to  T3 
introduced  in  the  doublet  approximation.  The  isotopic  spin  of  the  particle  is 
given  by  I  =  A  +  B.  Indeed  this  relation  suggests  a  method  of  breaking  the 
symmetry.  We  can  have  two  pions  tt^  and  71  f,  which  depend  respectively  on  A 
and  B  alone,  the  ^3  or  ^3  values  of  jt^  and  ttj  being  (1,0)  and  (0, 1)  respectively. 
It  follows  now  that  ttj  connects  only  Z~,  i7+,  Y^  and  Z^.  A  virtual  pion  can  be 
emitted  as  tt^  and  absorbed  as  jif,  thus  breaking  the  laws  of  conservation  of  A 
and  B  separately.  If  however  virtual  pion  processes  are  not  important,  we  can 
expect  A  and  B  to  be  conserved  separately.  Otherwise  only  the  total  A  ^  B, 
i.e.  isotopic  spin,  is  conserved. 

As  an  application  of  this  model  of  the  strong  interaction  let  us  consider  the 
charge  exchange  scattering  of  K^.  This  process  is  forbidden  because  of  the  failure 
of  the  conservation  of  ^3 .  However  we  note  that  there  is  large  charge  exchange 
scattering  in  the  energy  region  200-300  MeV,  though  at  low  energy  regions  there 
is  very  little  charge  exchange  scattering.  This  is  at  best  explained  by  invoking 
a  resonance  involving  virtual  pions,  in  which  case  we  have  no  separate 
conservation.   A  similar  difficulty  exists  in  explaining  K~-p  caipture  reaction. 

For  the  sake  of  completeness  we  shall  also  discuss  a  model  of  weak  interaction 
which  follows  from  this  scheme  of  strong  interactions.  As  we  have  seen  earlier 
the  weak  interaction  can  be  pictured  as  an  interaction  of  a  current  with  itself, 


i.e.  J]f,Jy,.  We  also  know  that  J^  can  be  written  as 


(46) 
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where  the  terms  have  the  previous  meaning.  In  order  that  vector  coupling  need 
not  be  renormahzed,  it  was  shown  that  J,p  should  be  coupled  to  the  operator  7+ . 
In  the  present  case  /+  =  ^4+  +  5+ .  For  simplicity  let  as  assume  J,g  to  be  coupled 
to  A^.  alone.  As  long  as  ^"'^  is  separately  conserved,  the  renormalization  of  the 
eoupUng  constant  is  not  required.  Thus  the  form  Jg^  is 

Jsc  =  (pn)  +  {£^  Y)  +  (ZE-)  +  (S^S-) .  (47) 

It  can  be  verified  that  J^^.  has  A^=  — 1  and  B^  ==■-  0,  i.e.  its  A  spin  is  1,  and  B 
spin  equals  zero. 

We  shall  now  determine  the  structure  ofJg„c  by  discussing  the  most  mysterious 
feature  of  K  particle  decays,  i.e.  the  slow  decay  of  K^  ->7i^  -\-  n^  as  compared 
t-o  the  271  decay  mode  of  the  neutral  iC-meson.  We  will  so  determine  the  structure 
of  Jgnc  such  that  K^  decay  is  allowed,  while  K^  is  forbidden  on  neglecting  the 
virtual  pions  which  is  an  interesting  feature  of  this  model.  To  see  this  we  note 
that  Jg„f.  can  consist  of  the  following  terms 


^•13 

^3 

py 

-1 

V2 

-ipz) 

0,0 

-V2 

(ni:-) 

0,0 

-V2 

(r,n) 

-1 

-V2 

where  0  indicates  that  the  term  belongs  to  a  triplet  of  ^  =  1 .  If  we  retain  only 

terms  with  ^3  =  0  for  non-leptonic  decays  of  the  strange  particles  induced  by 

JlcJgnc  +  ^  '  ^   we   will  have  the  selection  rule  AA^=  ±\.  From  the  quantum 

numbers  of  iC^  and  71^  we  see  that  the  decay  K^  ->  71^  +  7i^  is  forbidden.  If  we  have 

only  (pY),  we  can  verify  that  the  decay  K*  ->  JrJ  +  JiJ  is  allowed  because  of  the 

presence  ^  =  1  state.  But  on  adding  the  term  [E^n),  the  total  coupling  oiJgcJlnc 

has  Zl^3  =  0  with  ^  =  0  or  2  states  only.  K^  is  then  forbidden  to  decay  into 

71^  -t-  JiJ  s^nd  also  into  states  jr^  +  ^ttJ  and  71^  +  n^  because  of  Zl  ^3  =  0.  The  decay 

into  the  state  71I  -r  .t?  is  equally  prevented.  However  if  we  include  the  terms  like 

{pZ)  or  {nZ~),  these  decays  become  possible.  Thus  we  determine  the  structure 

of  J,„^  as 

J,.„=(vZ)+(i:*n).  (48) 


s  nc 


Adding  the  corresponding  terms  with  S  we  have 

■J>nc  =  ipy)  +  {S*  »)  +  [^  ^-)  +  (Z.  3-) .  (49) 

The  main  drawback  of  this  is  the  existence  of  terms  with  strangeness  —  1  and  +  1 
which  implies  that  we  can  have  decays  like  AS  =  2  for  which  there  is  very  little 
evidence  as  discussed  before.  The  total  weak  interaction  takes  a  very  sjnnmetric 
and  simple  form 

J  =  v{e  ^  ,1)  -i-  ip  -r  £^){n  +  Y)  +  {S^  +  Z^){S-  +  E-) .  (50) 

This  current  with  equal  coefficients  for  J^^  and  J^^c  will  allow  a  higher  branching 
ratio  for  the  /^-decays  of  hyperons,  i.e.  E^  ->  n  +  e'^  +  v,  A^  ^^^  j>  +  e~  +  v  than 
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SO  far  observed  from  the  experiments.  Of  course  as  pointed  out  earlier,  it  is  not 
clear  whether  J^^j^  has  the  same  coupling  constant  as  the  other  parts  of  the  total 
weak  interaction  current. 

We  will  conclude  our  discussion  of  this  model  with  a  few  remarks  on  the  selec- 
tion rules.  For  the  leptonic  decays  with  the  emission  of  negative  leptons 

AA,=^+l,    AB,  =  -^I^;     |zl^|=l,     |zlB|-^V2-  (51) 

For  the  emission  of  positively  charged  leptons  we  have 

AA,^-i,   Jj53=+V2.    M^|=i;    M^HVa.  (52) 

However  it  forbids  K+  ^  /bf^  +  v  since  AA^=^0.  For  non-leptonic  decays  with 
A  S  =  -±1,  the  selection  rules  are 

AB^^-i,  AAs  =  0;         |ZlB|  =  V2;         \AA\==0     or     2.  (53) 

The  rule  ^1^83  =  — 1/2  allows  K^  decay  through  both  ^  =  0  or  2.  If  we  do  not 
allow  A  =  2  state,  then  we  have  the  usual  \AI\  =  '^j^  rule.  These  rules  for  the 
Sti;  decay  modes  for  K+  and  K^  predict  that  K^ -^  71'^  +  ji~  +  n^  is  half  as  fast 
as  ir+  ->  271+  +  TT"  while  Bn^  decay  of  K^  is  thrice  as  fast  as  ir+  ->27rO  +  71;+. 
It  is  to  be  noted  that  there  are  three  channels,  i.e.  all  the  three  pions  are  jr^^ 
Avith  ^  ==  0  or  one  jr^,  and  two  n'as  with  ^  =  0  or  2. 

We  have  presented  the  various  "wild  ideas"  on  strong  interactions  with  a 
view  to  pointing  out  the  inadequacy  of  our  knowledge  about  them.  Most  of  the 
models  are  incomplete  and  yet  we  cannot  distinguish  between  them  because 
of  our  inability  to  calculate  accurately  the  consequences  of  the  strong  interactions. 
The  search  for  more  symmetries  of  strong  couplings  seems  to  be  the  only  way 
of  understanding  the  strong  interactions.  We  can  safely  remark  that  the  laws  of 
strong  interaction  have  not  been  understood  as  yet. 

3.  COMPOUND    MODELS 

When  the  new  particles  were  being  discovered,  attempts  were  made  to  under- 
stand the  spectrum  of  the  elementary  particles  using  compound  models.  In  these 
models  some  of  the  elementary  particles  were  treated  as  fundamental  and  the 
rest  of  the  particles  were  pictured  as  bound  states  made  up  of  these  constituents. 
A  number  of  compound  models  became  possible  because  of  the  different  choices 
of  the  fundamental  constituents.  The  general  idea  underlying  the  compound 
models  is  to  use  a  minimum  number  of  fields.  Heisenberg  and  his  collaborators 
have  studied  systematically  a  model  of  elementary  particles  using  a  non-linear 
spinor  equation. 

We  shall  deal  here  with  some  of  the  phenomenological  models.  Fermi  and 
Yangi  first  proposed  that  the  pions  should  be  considered  as  the  bound  states  of 


1  E.  Fermi  and  C.  N.  Yang,  Phys.  Rev.  76,  1739  (1949). 


352  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

a  nucleon  and  an  antinucleon.  A  combination  of  a  nucleon  and  an  antinucleon 
A^ill  yield  the  following  four  states  composed  of  a  triplet  and  a  singlet,  i.e. 


pn->7t* 


triplet.  (I) 


np-^  71' 
-Tj^ipp  —  nn)->7jfi'  —  singlet 

The  last  particle  has  not  been  so  far  observed  though  it  has  been  proposed  by 
Nambu  and  others^  in  order  to  explain  some  anomalous  features  the  electro- 
magnetic structure  of  the  nucleons.  Apart  from  this  drawback,  the  main  difficulty 
of  the  picture  is  to  explain  the  large  mass  difference  between  NN  and  tt  which 
has  to  be  accounted  for  by  postulating  a  very  deep  attractive  potential  well 
between  N  and  N  to  provide  for  the  large  binding  energy.  Indeed  the  explanation 
of  large  value  of  the  binding  energy  is  a  problem  in  all  the  compound  models. 
Goldhaber  proposed  a  model  by  treating  the  nucleons,  the  iii-mesons  and  their 
antiparticles  as  fundamental.*  Since  the  hyperons  have  strangeness  —1,  it 
is  clear  that  we  should  have  an  attractive  potential  between  K  and  the  nucleons 
which  is  fortunately  true,  experimentally.  The  A  and  Z  hyperons  are  obtained  as 

(2) 
{nK-)-yZ-  (pK--nK'>)->A'> 

Since  both  the  nucleons  and  iT-mesons  are  isospinors,  the  iso topic  spin  assign- 
ments of  Z  and  A  also  follow.  Let  us  choose  an  isotopic  spin  dependent  potential 
of  the  form 

F=-Fo  +  Ti-T2Fi  •  (3) 

where  0  <  F^  <  Vq.  Since  t^  •  Ta  =  —3  for  /  =  0,  and  +1  for  /  =  + 1,  ^  has 
more  binding  energy  than  Z  and  so  has  a  lower  mass  than  Z.  It  is  pertinent  to 
point  out  that  such  a  model  of  the  A  and  Z  hyperon  cannot  account  for  the  up- 
down  asymmetry  of  the  pions  resulting  from  the  decays  of  the  hyperons.  For  we 
can  picture  the  decay  of  the  hyperons  as  resulting  from  the  decay  of  the  iT- mesons, 
either  through  27r  or  3  jr  modes,  the  excess  of  pions  being  absorbed  by  the  nucleon 
"core".  However  no  asymmetry  will  result  from  the  decay  of  a  spinless  boson 
into  spinless  bosons  for  the  conservation  of  angular  momentum  does  not  allow  a 
mixture  of  angular  momentum  partial  waves  in  the  final  state.  In  order  to  explain 
the  very  interesting  features  of  the  asymmetry  in  the  various  channels  of  i7-decay, 


^  Y.  Nambu,  in  the  Proc.  7th.  Rochester  Conference,  Interscience,  New  York  (1957). 
2  S.  Goldhaber,  Phys.  Rev.  101,  433  (1956).  See  also  Proc.  7th  Rochester  Conference, 
Interscience,  New  York  (1957). 
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Barshay  and  Schwartz  have  recently  considered  i7  as  a  compound  state  of  a 
A  and  n.^ 

To  obtain  the  cascade  particles  of  strangeness  — 2  it  is  obvious  that  we  should 
compound  the  A  on  I^  with  K  's.  When  we  do  so  we  obtain  two  isodoublets 
and  a  quadruplet  with  /  =  ^/g.  Since  only  one  doublet  has  been  observed,  i.e. 
{S~,  S^)  the  model  is  faced  with  serious  difficulties.  Absence  of  hyperons  with 
strangeness  +  1  can  be  easily  explained  by  means  of  a  repulsive  potential  between 
the  nucleons  and  ^-mesons  which  also  seems  to  be  supported  by  the  experiments 
on  K^-N  scattering. 

Another  very  interesting  model  which  has  many  attractive  features  and  which 
has  recently  received  attention  is  that  of  Sakata  which  treats  p,  n  and  A  as  fun- 
damental particles.  2  We  have  already  referred  to  the  important  features  of  this 
model  from  the  point  of  view  of  universal  Fermi  interactions.  We  now  give  the 
picture  of  elementary  particles  following  from  this  model  which  is  essentially 
an  extension  of  the  Fermi- Yang  model  with  the  addition  of  A  to  account  for  the 
strange  particles.  We  have 

NA->K;        NA->K 

(4) 
(N(NA))->Z;        {A{NA))-^S. 

There  is  a  large  fluctuation  in  binding  energy  as  successively  a  baryon  and  an 
antibaryon  are  added  as  seen  from  the  masses  of  K  and  H  particles.  Recently 
Yamaguchi^  has  proposed  a  model  of  strong  interactions  on  the  basis  of  this 
model.  Thirring*  has  studied  a  model  of  strong  interactions  employing  only  three 
fields  as  fundamental.  These  fermion  fields  are  two  component  spinor  fields. 

4.  SPACE-TIME  PROPERTIES  AND  INTERNAL  SYMMETRIES 
OF  STRONG  INTERACTIONS 

It  is  now  well  established  that  while  the  weak  interactions  do  not  conserve 
parity,  the  strong  interactions  are  invariant  under  space  reflections.  It  is  rather 
surprising  that  such  a  geometrical  invariance  should  be  at  all  related  to  the 
strengths  of  the  interactions  involved  in  a  particular  phenomenon.  However  we 
note  that  the  strongly  interacting  particles  possess  internal  degrees  of  freedom 
such  as  isospin  and  strangeness  which  the  leptons  do  not  seem  to  possess.  It  is 
also  surprising  that  the  domain  of  validity  of  the  parity  conservation  and  strange- 
ness conservation  coincide.  We  are  thus  led  to  expect  an  intimate  relation  between 
"internal"  symmetries  and  the  space-time  symmetries.  We  will  here  summarize 
some  important  investigations  based  on  this  approach.  We  will  assume  that  all 


1  S.  Barshay  and  M.  Schwartz,  Phys.  Rev.  Lett.  4,  618  (1960).  See  also  A.  P.  Balachan- 
DRAN  and  N.  R.  Ranganathan,  Nuovo  Cim.  18,  1269  (1960);  G.  Bhamathi,  S.  Indumathi, 
A.  P.  Balachandran  and  N.  G.  Deshpande,  Nuovo  Cim.  19,  190  (1061). 

2  S.  Sakata,  Progr.  Theoret.  Phys.  16,  686  (1956). 

3  Y.  Yamaguchi,  Progr.  Theoret.  Phys.  Supplement,  11,  37  (1959). 
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the  interactions  of  the  elementary  particles  are  invariant  under  CP  which  seems 
to  hold.  The  following  theorems  can  be  proved: 

(a)  Theorem  1 :  If  CP  invariance  holds  and  if  the  Yukawa  coupling  is  direct 
then  parity   must   be   conserved  for  charge  symmetric  interactions  of  the  form 

Assuming  no  parity  conservation,  the  interaction  Lagrangian  is 

/>int=  ^  r{g,  +  Qpy^)  N '  n  +  N  r{g*  -  g*  y^)  N '  jt 

-^r{g',  +  g'^y,)N:!t  (1) 

where  g'^  is  pure  real  and  g'^  is  pure  imaginary.  In  this  it  is  to  be  noted  that  the  jr 
field  is  self- conjugate,  i.e.  3t=n^ .  If  we  now  demand  CP  invariance,  we  find  that 
either  g'^  or  g'j,  should  vanish,  for  to  have  CP  invariance  all  the  coupling  constants 
should  have  the  same  phase.  To  observe  the  role  of  internal  symmetry  in  this 
theorem,  xoe  note  that  charge  symmetry  of  the  interactions  requires  that  the  coupling 
constants  characterizing  the  p-nn^  and  the  np7i~  interactions  be  the  same  not  only  in 
absolute  magnitude  but  also  in  phase  and  sign. 

We  can  also  verify  that  if  C  invariance  holds  and  if  the  n-N  interaction 
must  be  of  the  derivative  type,  then  it  must  conserve  parity  and  yr^  must  be 
pseudoscalar  provided  tiP  is  even  under  charge  conjugation.  If  we  now  further 
assume  charge  independence,  the  charged  pions  are  also  pseudoscalar,  a  result 
pointed  out  by  Sakurai.^  It  is  to  be  noted  that  tz^  is  even  under  C,  independent 
of  whether  n^  is  scalar  or  pseudoscalar.  This  will  follow  provided  that  n^  is 
spinless  and  we  allow  the  virtual  process  n^  ->  p  +  p. 

These  results  apply  equally  to  SSn  and  ZZn  interactions.  However  these 
theorems  in  no  way  help  us  to  prove  that  the  ZAn  interaction  will  be  parity 
conserving.  To  see  this,  let  us  write  a  CP  invariant  but  parity  non- conserving 
and  charge  independent  ZAn  interaction,  i.e. 

L2:An  =  -^(a  +  by^)Ajt  +  A{a--by,)  Z:t.  (2) 

It  is  easily  seen  that  charge  independence  of  the  interaction  does  not  guaraTitee 
the  invariance  of  the  Lagrangian  under  Z<-^A  interchange.  Thus  we  cannot 
obtain  any  restriction  on  the  structure  of  the  interaction.  To  obtain  such 
restrictions,  we  have  to  introduce  the  doublet  approximation  for  the  Z-A 
system  and  assume  g2:An  =^  ^EXn-  Then  we  know  from  our  previous  discussions 
that  the  n  interactions  with  Z  and  A  have  the  same  form  as  n-N  interactions. 
We  can  thus  obtain  parity  conservation  in  ZAn  interactions.^ 

Turning  now  to  K  interactions,  since  the  K  's  connect  different  baryon  multi- 
plets,  it  is  clear  that  we  cannot  exchange  the  member  of  a  doublet  with  those 
of  a  triplet.  We  can  hope  to  obtain  necessary  restrictions  for  parity  conservation 


1  G.  Feinberg,  Phys.  Rev.  108,  878  (1957).  S.  N.  Gupta,  Cam.  J.  Phys.  86,  1309  (1957); 
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on  previous  lines  only  when  we  group  all  the  baryons  into  doublets.  The  above 
discussions  on  charge  independent  n-N  interactions  clearly  show  that  for  the 
cosmic  symmetry  model  of  K  interactions  parity  will  be  conserved  provided 
the  interactions  are  CP  invariant  since  cosmic  symmetry  provides  for  hyper- 
charge  symmetry. 

To  discuss  the  case  of  K  interactions  when  no  such  symmetry  is  assumed, 
let  us  write  the  CP  invariant  non-derivative  Yukawa  iT-baryon  interactions  as 

ii..t  =  ^1  (g,  +  9p  V,)  N^  Ko  +  1%  (g*  -  g*  y,)  JV,  K"  t 

+  ^i(9i+g'py,)N:,K-  +  N,(g*'-gtY,)NiK*'> 

+  ^% (h  +  ipY.)  N, K"  +  N, if*  -  /* y,)  N, K" t 

+  iV,(/;  +  /;  y,)  N, K*  +  N,(r*  -  r*  y,)  N, ^^  t.  (3) 

Considering  the  first  two  terms  alone  in  the  above  expression  we  find  that  for  the 
first  to  be  invariant  under  CP  we  should  have 

9s  =  r]i9f,         9p-9pVi  (4) 

where  jy^  is  the  product  of  the  phase  factors  for  the  N^,  N^  and  K  fields 
under  CP,  i.e.  ^*  =  ^^^^  ^j\r,^£o .  Similar  conditions  can  be  obtained  for  the  other 
terms  and  the  interactions  can  be  written  using  these  relations 

Liut  =  gA^i  ^2  K'>  +  m  ^2  N^  ir»  t)  +  g^  (N^  y^  N^  K"  -  ti,  N^  y,  N^  K<>  t) 

+  g', {N,  iV, K*  +  rj[N, N, K^  t)  +  g;{N,  y,  iV, K*  -  r{^  N, y,  N,  K* t) 

+  fAN^N.Ro  +ri,N,N,K'>'')  +  f^{N,y,N,K-'  -  rj.N.y.N.KOi) 

+  n  (N,  N,  K*  +  ri',  N,  N,  K*  t)  +  /;  (N,  y,  N,  K*  -  r{^  N,  y,  N,  K*  t) .    (5) 

To  obtain  parity  conservation  for  K^  interactions,  let  us  define  the  "charge 
symmetry"  operation 

N^^N^        N^^N^        K^<->K''\  (6> 

In  order  to  preserve  the  terms  involving  K"^  we  see  that  K"^  should  transform  as 

where  £+  is  a  phase  factor  with  |£+  |  =  1.  Invariance  under  (6)  imphes  that 

^^9s=fsl  ^■,9p  =  fp-,         n'i  =  'n2-  (7) 

Also 

98  or  9p  =  0,         /,  or  fp  =-  0,        77i  =  ±  1,        rj2=^  ±1. 

This  K^  interaction  preserves  the  parity  quantum  number.  For  K^  interactions 
to  conserve  parity,  we  have  the  transformation 

N^^N^,        N^^N^,        K^-^K^"^  with  K^^SoK^. 

It  is  easy  to  see  as  before  that  this  will  ensure  parity  conservation  for  iC+  inter- 
actions. We  have  not  so  far  assumed  any  relation  between  the  interactions  of 
K^  and  K^  and  also  properties  like  the  relative  parity  of  K^  and  K^. 
23* 
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Though  we  have  shown  that  parity  conservation  exists  for  strong  interactions* 
we  have  yet  to  show  why  the  weak  interactions  which  have  no  internal  symmetry 
do  not  have  parity  conservation.  Sakurai^  has  pointed  out  that  if  we  impose 
in  variance  under  G  conjugation,  for  strangeness  conserving  interaction  we  have 
equal  mixtures  of  eigenspinors  of  y^  while  for  strangeness  non-conserving  inter- 
actions, G  conjugation  generates  a  charge  non- conserving  interaction  since  G 
conjugation  amounts  to  inversion  in  isospin  space.  We  have  the  interesting 
suggestion  that  the  origin  of  parity  conservation  lies  in  inversion  invariance  in 
isospin  space.  So  far  there  are  no  compelling  reasons  to  assign  isotopic  spin  to 
the  leptons,  a  fact  which  suggests  to  us  that  every  non-electromagnetic  process 
involving  leptons  is  parity  non-conserving.* 

Heisenber^'s  nonlinear  theory  of  elementary  particles^ 

From  a  survey  of  elementary  particle  interactions  it  is  clear  that  the  conven- 
tional methods  of  quantum  field  theory  of  representing  particles  by  their 
corresponding  fields  and  their  interactions  by  suitable  products  of  their  field 
operators  have  been  successful  in  explaining  in  some  measure  the  various 
phenomena,  albeit  part  by  part,  associated  with  these  particles.  These  partial 
successes  have  convinced  the  theoretical  physicists  that  there  must  be  a  "plan 
behind  the  maze" — a  plan  which  must  be  discerned  within  the  framework  of 
quantum  field  theory.  The  search  for  this  plan  has  led  the  creators  of  quantum 
field  theory  to  believe  in  a  "  unified  field  equation"  from  which  the  fields  associated 
with  all  the  particles  can  be  generated. 

The  mathematical  barriers  for  tackling  such  a  unified  equation  are  very  con- 
siderable. However  the  present  situation  demands  new  ideas,  as  in  the  days  just 
before  the  birth  of  quantum  theory.  Therefore,  Heisenberg  and  his  co-workers 
have  recently  formulated  and  developed  the  idea  of  such  a  "unified  field  equation  " 
and  pushed  it  to  the  extent  of  calculating  numerical  values  for  the  ratios  of 
masses  of  various  known  particles  and  the  coupling  constants  in  their  interactions. 
We  give  here  a  general  outline  of  their  theory  to  indicate  the  nature  of  the  attempt 
rather  than  provide  a  detailed  and  systematic  account  of  a  well  developed  theory. 

The  basic  belief  behind  this  theory  is  that  all  elementary  particles  are  stationary 
(or  quasi  stationary)  states  of  some  substance,  matter  or  energy,  since  they  can 
be  created  or  transformed  into  each  other  in  high  energy  interactions  provided 
such  transformations  do  not  violate  certain  conservation  laws.  The  distinction 
between  elementary  and  non -elementary  particles  is  just  a  convenient  convention 
and  not  fundamental.  Chew  and  Frautschi'*  have  recently  suggested  how  such  a 
*' principle  of  equivalence"  may  be  expressed  within  the  /S-matrix  framework  for 
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all  strongly  interacting  particles ;  if  supported  by  experiment,  their  ideas  might 
mean  that  the  concept  of  an  elementary  particle  would  lose  much  of  its  usefulness 
(for  details,  ref.  Appendix  14).  It  is  hoped  that  a  unified  description  of  matter  is 
finally  bound  to  be  simpler  than  the  description  of  various  particles  through  their 
individual  fields.  Such  a  unified  theory  must  be  a  spinor  theory  so  that  particles 
of  half  integral  spins  as  well  as  integral  spins  can  be  generated  out  of  it. 

In  his  attempt  to  evolve  a  unified  theory,  Heisenberg  suggested  the  following 
basic  non-linear  differential  equation  for  matter 

dw       „ 

yf^J^±i^y,.y5Viy^yf^y5'^)  =  o.  (i) 

The  y's  are  the  usual  gamma  matrices  and  I  is  a  new  independent  variable  of  the 
dimension  of  length  whose  significance  will  be  explained  shortly.  The  non-linear 
term  signifies  interaction.  In  this  theory  it  is  not  meaningful  to  ask  the  question 
how  many  fields  are  necessary  to  represent  all  the  observed  particles,  or  how 
many  particles  a  given  field  operator  can  represent.  Only  questions  regarding  the 
group  properties  of  the  field  operators  and  their  irreducible  constituent  parts 
may  be  answered  satisfactorily.  The  totality  of  the  matrix  elements  of  the  complex 
operator  ip  can  be  broken  up  into  a  series  of  irreducible  sub-parts  which  transform 
differently  under  various  transformations  and  can  therefore  be  regarded  as  the 
wave  functions  describing  many  particles,  stable  and  unstable.  To  establish  a 
connection  between  the  field  operator  ip  [x)  and  wave  functions  (p  [x)  we  must 
examine  the  matrix  elements  (p{x)  oiip{x)  (or  products  oi\p{x))  between  vacuum 
and  one  particle  states. 

We  shall  now  refer  to  the  in  variance  properties  of  equation  (1)  under  various 
transformations . 

(a)  Transformations  of  continuous  groups 

(1)  The  fundamental  equation  (1)  is  invariant  under  all  transformations  of 
the  inhomogeneous  Lorentz  group.  The  conservation  laws  for  energy,  momentum 
and  angular  momentum  consequently  follow. 

(2)  It  is  invariant  under  the  Pauli-Giirsey  transformations 

ip  -^  a*  ip  +  b*  y)"^  C  y^  (2) 

This  group  is  isomorphic  to  the  group  of  rotations  in  a  three-dimensional  space. 
Hence,  invariance  under  transformations  (2)  implies  the  conservation  of  a  quan- 
tum number  /,  analogous  to  angular  momentum  in  ordinary  space.  /  is  the  iso- 
spin  and  its  third  component  (by  convention)  is  connected  to  the  charge  Q  by 
Q  =  Iz  +  IqI^,  where  Iq  is  a  new  quantum  number  introduced  in  this  description 
of  elementary  particles. 
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(3)  Equation  (1)  is  invariant  under  the  transformations  of  the  Touschek  group, 

«;->.e~'*^»V 

(3) 

and  the  quantum  number  associated  with  it  is  I^  which  is  connected  to  the  baryon 
and  lepton  numbers  {N  and  L  respectively)  through  another  newly  introduced 
quantum  number  ly.  The  relation  is 

2 

L  =  Iy-lj^  (4) 

2 

The  two  newly  introduced  quantum  numbers  Iq  and  ly  are  connected  to  strange- 
ness S  by 

S  =  lQ-h.  (5) 

While  Iq  and  ly  may  run  through  any  positive  or  negative  integral  values,  it  is 
found  that  to  explain  observed  facts,  it  is  sufficient  if  S  takes  only  the  values  0, 
±1,  ±2.  Therefore  one  more  continuous  one  parameter  group  is  needed  to  re- 
present Iq  or  ly.  Fortunately  equation  (1)  contains  another  one-parametric 
continuous  group  since  equation  (1)  is  invariant   under   the    so-called    "scale 

transformation  " 

y)^fj'liip(xrjj  If]).  (6) 

y)  has  to  be  regarded  as  a  function  ofx  and  I  in  order  to  define  this  transformation. 
/  plays  the  role  of  an  independent  variable  and  therefore  the  totality  of  solutions 
of  equation  (1)  for  all  possible  values  of  I  must  be  considered.  Since  the  scale 
transformation  does  not  commute  with  translation,  according  to  the  usual  inter- 
pretation of  quantum  theory  we  cannot  simultaneously  specify  the  mass  eigen- 
value of  a  state  and  the  quantum  number  corresponding  to  scale  transformation. 
This,  as  stressed  by  Heisenberg,  means  that  only  the  ratios  between  the  masses 
of  elementary  particles  can  be  experimentally  tested.  In  nature  there  is  no  unit 
of  mass  or  length. 

The  scale  transformation  can  also  be  given  as 

0^y^(x,l)0-^  =  r;i^y){xri,lri).  (7) 

The  operator  O^  corresponding  to  an  infinitesimal  scale  transformation  is 

0^  =  e''^  =  {eY={l  +  e)'^  =  rj^  (8) 

where  r)  =  I  -\-  s  (e  small).  The  eigenvalues  of  yl  can  be  used  to  specify  the  quantum 
states.  This  quantum  number  must  be  identified  with  lyl2  mentioned  earlier  and 
for  particles  with  S  =  0  also  with  Zq/2.  Since  the  non-linear  equation  (1)  contains 
only  odd  integral  powers  of  y)  and  not  arbitrary  powers,  the  matrix  elements  o£yj 
should  correspond  only  to  transitions  producing  half  integral  changes  in  quantum 
numbers  of  A .  The  greatest  physical  interest  is  attached  to  AA=±i  which 
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corresponds  to  the  creation  and  destruction  of  particles  with  yl  =  ±  J  (The  states 
with  half  integral  eigenvalue  of  A  are  produced  by  applying  an  odd  number 
of  y)  operators  on  the  vacuum  and  states  with  integral  eigenvalues  by  applying 
an  even  number  of  ^'s). 

(b)  Discrete  groups 

The  only  clearly  understood  discrete  transformations  are  C,  P  and  T.  Since  yj 
contains  another  independent  variable  I,  we  can  define  another  transformation 
which  commutes  with  all  the  others  and  hence  is  suitable  for  defining  a  quantum 
number. 

Whether  strangeness  can  be  incorporated  into  the  scheme  of  a  discrete  group 
is  yet  to  be  investigated.  The  usual  charge  conjugation  C  and  parity  P  operations 
do  not  commute  with  the  transformations  of  the  Pauli-Giirsey  and  Touschek 
groups  respectively.  Therefore  these  operations  cannot  be  used  to  define  quantum 
numbers.  Heisenberg  and  his  co-workers  were  able  to  show  that  other  operations 
called  C  and  P  of  the  second  kind  can  be  defined  which  commute  with  these 
transformations.  Thus  we  can  assign  only  a  parity  of  the  second  kind  along 
with  I^.  Recently  Diirr  has  succeeded  in  connecting  the  parity  of  the  second 
kind  with  the  operation  Z-> — I  and  in  demonstrating  its  precise  conservation. 

We  have  already  mentioned  that  the  wave  functions  99  (a;)  of  individual  particles 
can  be  derived  as  the  matrix  elements  of  the  operator  ip  (x)  (or  of  products  of  such 
operators).  It  is  found  that  the  field  equation  of  particles  with  non-zero  mass 
(described  by  q)B{^))  and  having  the  full  symmetry  of  the  fundamental  field 
equation  cannot  be  a  differential  equation  of  the  first  order  (Dirac  equation).  However 
it  can  be  a  second  order  equation  like  the  Klein-Gordon  equation  for  spinor  fields. 
It  is  well-known  that  a  Klein-Gordon  equation  for  a  four- component  spinor  can 
be  put  into  the  form  of  a  Dirac  equation  by  increasing  the  components  to  eight. 
This  is  done  by  introducing  the  new  symbols 


^ji{x) 


Defining  an  eight  component  spinor 


1     d,f„ 

X  ^'  8^, 

spinor 

0=  ^ 

■  (f(x) 

it  can  be  shown  that  0  satisfies  the  first  order  differential  equation 


(9) 


(10) 


(11) 


where  /  = 


10 
01 


;^\ 


01 
10 


are  eight- by-eight  matrices.  Transforming  0  to 

<?.'  =  -^[r,  +  Z3]<p  =  i(''^"^!^)  (12) 

V2  \<Pb-<PbI 
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the  above  differential  equation  splits  up  into  two  Dirac  equations 

(y.-^  +  x)ii<PB-^^B)=^o  (13) 

A  four  component  field  function  which  satisfies  a  Klein-Gordon  equation  is  equi- 
valent to  two  Dirac  equations  of  four  component  functions  and  hence  is  sufficient 
for  describing  two  spinor  particles  (and  their  antiparticles) ;  that  is,  a  mass  doublet. 
We  can  identify  the  two  nucleons,  proton  and  neutron,  with  such  a  mass  doublet. 
In  deducing  these  wave  equations  the  following  requirements  have  to  be  con- 
sidered : 

(1)  The  wave  functions  of  the  nucleons  shall  be  positive  energy  solutions  of 
the  Dirac  equation 

(y'-^  +  x)<PN=o.  (14) 

(2)  If  the  operator  y}(x)  (and  hence  the  Klein-Gordon  function  q)^  also)  be 
subject  to  the  Pauli-Gursey  transformation  then  the  nucleon  wave  functions  q)y 
transform  among  themselves  according  to  the  usual  isotopic  spin  transformations. 

(3)  ii(pB  is  subjected  to  the  Touschek  transformation,  then  99^  shall  transform  as 

where  9'^W  ^e'"^-?-^  (15) 

/  Y  =  +  i  for  proton  and  neutron 

ly  =  — J  for  antiproton  and  antineutron. 
In  order  to  remove  the  ambiguity  in  connecting  the  Pauli-Giirsey  group  with  the 
isospin  group  (which  can  be  done  in  various  ways)  the  axes  of  rotation  must  be 
specified.  Therefore  we  have 

(4)  under  the  gauge  transformation  (pB->  ^^^^^^(Pb  ^^1  9^n  shall  transform  as 
under  a  rotation  round  the  third  isotopic  axis.  From  the  requirements  (1—4)  an 
explicit  representation  for  the  nucleons  can  be  given  as  follows: 

(Ppix)  =  il^l-—y,—jRq)Bi''^) 

(I         P   \ 
1-— n-g^]-B9'B(a:) 

=  i(i-Y)'.a^)«9'g(^)  (16) 

i-Y>''e^)-^?'§(^) 

(I  P)     \ 
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where  (f^  {x)  =  C'^  y,  ^^  (x)  =  D'^  ^^  {x) 


2 


(17) 


(1  Pi      \ 

1 yy  — —  I  all  wave  functions  here 

satisfy  the  Dirac  equation.  This  can  be  seen  thus,  q)^  satisfies  the  Klein-Gordon 
equation  so  that 


Hence 


{''i^^'-hih^-ji-i-'')''^^'-     '^'' 


(^'ir  +  x)<PN--o. 


(19) 


Thus  the  nucleon  wave  function  q)j^  transforms  like  an  isospinor  when  (p^  is 
subjected  to  transformations  of  the  Pauli-Giirsey-Touschek  group.  To  find  the 
transformation  of  99^  which  corresponds  to  parity  operation  on  cp^  we  first  note 
that  the  usual  parity  operation 

(p{r,  t)^y^(p{-r,  t) 

(p^{r,  t)-^-y,<p(-r,  t)  (20) 

does  not  commute  with  the  Touschek  transformation  with  which  the  baryonic 
number  is  associated.  Hence  this  operation  does  not  also  commute  with  the  pro- 
jection operators  in  the  expressions  for  nucleon  wave  functions  given  above. 
In  fact,  it  can  be  shown  that  the  parity  operation  of  the  first  kind  on  9?^  takes  p  ^-n. 
A  suitably  commuting  parity  operation  (of  the  second  kind)  is  defined  on  the 
energy  shell  of  the  nucleons  {pf  =  —  k^)  as  follows 

1  d 

(p{r,  ^)-^-  —  yvj^y4(pi-r,  t) 

^  "  (21) 

1  f) 

(p''{r,t)-^--y---y,cp''{-r,  t) . 

This  definition  assigns  opposite  intrinsic  parities  to  (p  and  99^.  If  we  give  opposite 
intrinsic  parities  to  neutron  and  proton  then  the  "parity  transformation  of  the 
second  kind"  on  (p^  is  equivalent  to  the  "parity  transformation  of  the  first  kind'* 
on  nucleon  wave  functions.  But  in  the  usual  theory,  protons  and  neutrons  are 
given  the  same  intrinsic  parity.  (It  may  be  pointed  out  that  the  fundamental 
equation  (1)  is  not  invariant  under  the  parity  transformation  of  the  second  kind 
if  equation  ( 1 )  is  extended  to  the  off  the  energy  shell  parts  of  ^  {x).)  But  parity  of  the 
second  kind  may,  according  to  Diirr,  be  connected  instead  with  operation  l^^—l 
under  which  equation  (1)  is  invariant. 
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The  usual  charge  conjugation  (first  kind)  on  q)^,  does  not  give  charge  conju- 
gation for  the  nucleon  wave  functions  q)jj .  The  charge  conjugation  of  the  second 
kind  defined  by 

on  the  energy  shell  of  the  nucleons  produces  the  desired  charge  conjugation  of 
the  nucleons.  The  fundamental  equation  (1)  is  invariant  under  the  "charge  con- 
jugation of  the  second  kind  "  only  if  this  operation  is  connected  with  l-^  —  lin  the 
way  suggested  by  Diirr.  It  is  interesting  to  note  that  the  combined  operation  CP 
(both  of  the  second  kind)  is  identical  with  the  combined  operation  CP  of  the  first 
kind. 

For  the  leptons  (in  the  narrow  sense  of  a  massless  electron-neutrino  doublet) 
a  parity  operation  and  charge  conjugation  of  the  second  kind  cannot  be  defined, 

1         d 
for  the  operator  (5'pl\p\  which  corresponds  to  the  operator  — y^~ —  for  the 

nucleons  now  commutes  with  the  Touschek  transformation.  On  the  other  hand, 
C,  P  and  T,  consequently  also,  CPT  remain  defined  which  is  sufficient  for  the 
description  of  experimental  facts  (see  chapter  VI). 

The  leptons  (in  the  sense  mentioned)  and  the  nucleons  can  be  brought  into 
a  single  system  by  using  the  in  variance  of  the  family  of  wave  equations  with 
parameter  I  under  the  scale  transformation. 

In  the  above  we  have  considered  at  some  length  the  wave  functions  of  the 
nucleons  which  form  the  manifestations  of  the  baryons  in  the  lower  approxi- 
mation. In  order  to  explain  the  production  of  the  strange  particles  (A^  and  E) 
which  have  spin  V2  ^^^  integral  isospin,  it  is  postulated  that  the  ground  state, 
called  the  "world"  which  forms  the  basis  for  the  existence  of  elementary  particles 
is  highly  degenerate  in  the  sense  that  it  has  a  very  high  isospin  and  hence  is  asym- 
metric. The  usual  fully  symmetric  vacuum  can  be  constructed  in  this  theory  if 
one  goes  over  by  the  transformation  xp=  (1  -\-  y^)  %  into  a  two  component  theory 
in  which  the  interaction  vanishes.  The  mass-difference  between  the  neutron  and 
proton  would  be  understandable  if  we  considered  them  as  the  two  doublet  com- 
ponents of  a  state,  "nucleon  +  world",  for  which  the  resulting  isospin  can  be 
either  parallel  or  antiparallel  with  that  of  the  "world"  which  again  carries 
an  arbitrarily  high  isospin. 

On  the  basis  of  this  interpretation,  we  can  understand  the  production  of  the 
strange  particles  A^  and  E  in  the  following  way.  By  applying  an  odd  number  of 
field  operators  on  the  ground  state  a  particle  with  half-integral  spin  is  produced. 
The  half-integral  isospin  which  is  thereby  added  to  the  ground  state  can  however 
be  so  coupled  to  the  large  isospin  of  the  world  that  the  resultant  isospin  of  the 
particle  manifests  itself  as  having  an  integral  value.  An  analogy  exists  for  this 
in  the  atomic  spectrum  of  the  calcium  atom.  The  spin  of  the  valence  electron 
appears  not  as  ^/g  (which  would  give  a  doublet  spectrum)  but  coupled  with  a 
spin  ^/2  of  the  inner  shells  so  that  a  triplet  spectrum  results  as  actually  observed. 
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Investigations  of  the  ''world"  expectation  value  of  a  product  of  two  field 
operators  have  shown  that  while  it  is  invariant  under  the  transformation  CP,  it 
is  not  invariant  under  P  and  G  separately  which  would  mean  that  a  single  strange 
particle  cannot  have  a  well-defined  parity  and  only  a  pair  of  such  particles  of 
opposite  strangeness  may  have  a  parity. 

Having  outlined  the  transformation  properties  of  the  non-linear  equation, 
we  shall  briefly  refer  to  the  method  of  deriving  mass  eigenvalues  of  elementary 
particles. 

The  principle  of  micro- causality  for  a  linear  field  theory  namely,  the  vanishing 
of  the  commutator  (or  anticommutator)  of  the  field  operators  for  the  space-like 
separation  and  its  non- vanishing  otherwise,  is  taken  over  to  the  non-linear  field 
theory  to  provide  a  method  of  quantization.  But  the  commutator  in  this  case  is 
expected  not  to  contain  the  6,  d'  singularities  on  the  light  cone  occurring  in  the 
usual  quantum  field  theory  but  to  exhibit  infinite  oscillations  there;  a  circumstance 
which  is  considered  as  a  consequence  of  the  non-linear  interaction  and  as  a 
condition  for  the  convergence  of  the  theory.  If  we  were  to  connect  the  commutators 
in  this  general  theory  with  those  corresponding  to  a  field  with  definite  mass,  by 
integration  over  a  weight  function  ^  (|)  of  the  mass,  such  a  q  should  take  negative 
values  also  since  the  d,  6'  singularities  have  to  be  weakened  on  integrating  with 
respect  to  ^.  This  impHes  the  use  of  an  indefinite  metric  in  Hilbert  space  (i.e. 
formally  negative  probabilities)  an  idea  the  depth  and  extent  of  which  is  not  yet 
fully  realized.  The  q  function  is  assumed  to  satisfy  the  conditions 

/^(l)d|^=.0. 

/^(|)|d|  =  0.  (23) 

\  Q  {^)e~  ^  d  ^  ^  1 .         (normalization) 

The  technique  of  evaluating  the  mass  eigenvalues  will  be  illustrated  here  only 
in  the  case  of  fermions.  The  fundamental  differential  equation  (1)  generates  an 
infinite  system  of  coupled  integral  equations  between  the  so-called  "t  functions". 
(The  matrix  elements  of  the  time  ordered  products  of  the  field  operators  between 
the  vacuum  and  a  one-particle  state)  connecting  t  functions  of  N  variables 
and  N  -\-2  variables.  To  obtain  an  eigenvalue  equation,  it  must  be  possible  to 
reduce  the  functions  with  a  large  number  of  variables  to  t  functions  of  a  smaller 
number  of  variables.  Therefore,  another  set  of  functions  q)  (uniquely  related 
to  the  T  functions)  is  introduced  such  that  a  Tamm-Dancoff  approximation  of 
cutting  off  the  system  of  integral  equations  at  a  finite  workable  stage,  i.e.  setting 
(pf^  to  zero  when  n  >  N  i^  meaningful.  The  finite  system  of  coupled  integral 
equations  so  obtained  defines  an  eigenvalue  problem.  The  contraction  function 
to  be  used  when  connecting  t  and  q)  functions  is  the  propagator  weighted  with 
o(^)  containing  only  one  mass  in  an  approximation  to  facilitate  the  calculation. 
If  in  general  q  (|)  is  allowed  to  contain  more  masses,  these  mass  values  are  obtained 
as  a  partly  discrete,  partly  continuous  spectrum.  For  the  calculation  of  the  masses 
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of  the  fermions,  we  start  with  the  function  of  one  variable.  Retaining  only  (p- 
functions  of  one  variable  in  the  system  of  equations  (Tamm-Dancoif  approxi- 
mation) and  setting  q>{x)  =  t{x)  (by  definition)  we  are  led  to  the  differential 
equation  of  the  form 

y,P^Qi-p^)T{x)  =  o.  (24) 

The  conditions  relating  to  the  existence  of  the  solutions  lead  to  the  search  for 
the  zeroes  of  Q{—p')  which  give  the  mass  numbers  of  the  elementary  particles. 
T  functions  of  an  odd  number  of  variables  are  used  for  calculating  the  mass  spectrum 
of  fermions  and  that  of  an  even  number  of  variables  for  bosons. 

A  similar  calculation  of  the  mass  eigenvalues  for  the  strange  particles  has  not 
been  carried  out.  It  was  found  that  an  interaction  between  the  isospin  of  a  particle 
and  of  the  ground  state  cannot  be  expressed  if  one  uses  only  the  vacuum  expec- 
tation values  of  products  of  two  field  operators  which  enter  the  calculation  of 
mass  eigenvalues  and  hence  strange  particles  cannot  appear  as  eigenvalues. 
To  obtain  their  mass  spectra  we  have  to  include  the  vacuum  expectation  value 
of  products  of  at  least  four  field  operators  of  which  not  much  is  known  (see 
chapter  V).  But  the  special  properties  of  equation  (1)  may  perhaps  make  its 
evaluation  easier  than  in  the  case  of  the  usual  local  field  theories. 


Part  Two 
COSMIC  RAYS 


'^'.  .  .  Aetherial  Dominations  who  possess, 
Elysian,  windless,  fortunate  abodes. 
Beyond  Heaven's  constellated  wilderness/ 

P.  B.  Shelley 


CHAPTER  X 

PRIMARY  COSMIC  RADIATION 

1.  PRIMARY   COSMIC   RADIATION^ 

The  first  essential  pre-requisite  to  any  study  of  cosmic  ray  phenomena  is  a 
detailed  and  accurate  knowledge  of  the  nature  and  composition  of  the  primary 
radiation.  It  is  from  such  a  knowledge  that  we  have  to  develop  a  theory  of  the 
origin  of  cosmic  rays  which  in  turn  will  throw  light  on  the  fundamental  nuclear 
processes  in  the  stellar  and  interstellar  regions.  Unless  our  knowledge  of  the  prim- 
ary radiation  is  precise,  any  theoretical  prediction  we  may  make  about  nuclear 
interactions  of  cosmic  radiation  in  its  passage  through  the  atmosphere  will  not 
be  reliable.  Hence  experimenters  have  spared  no  efiTorts  in  devising  new  tech- 
niques to  push  the  accuracy  of  our  knowledge  to  the  greatest  possible  extent. 
The  study  of  primary  cosmic  radiation  can  be  broadly  divided  into  three  parts : 
(1)  the  determination  of  the  mass  spectrum  of  the  incident  particles,  (2)  the 
energy  and  the  spatial  distributions  of  the  incident  particles  and  their  variation 
on  the  surface  of  the  earth  and  at  various  levels  of  the  atmosphere,  and  (3)  the 
temporal  variations  of  the  primary  component. 

The  most  striking  features  of  cosmic  radiation  are  the  near  constancy  of  the 
mass,  energy  and  spatial  distributions  at  any  particular  point  on  the  earth  and  the 
extreme  smallness  of  the  temporal  deviations.  Thus  any  theory  of  cosmic  rays 
and  their  origin  should  in  the  first  instance  explain  this  constancy.  However  it 
has  been  found  that  the  slight  deviations  themselves  may  serve  as  leads  in  speculat- 
ing upon  the  origin  of  cosmic  rays.  We  shall  now  deal  with  the  three  aspects  one 
after  the  other.  " 

The  mass  spectrum 

Primary  cosmic  radiation  consists  of  positively  charged  particles  which  have 
been  identified  to  be  nuclei  ranging  from  the  lightest  (hydrogen)  to  the  heaviest 
(iron).  Their  mass  spectrum  as  compared  with  the  relative  abundance  in  the 
universe  is  given  in  the  table.  The  mean  number  of  nuclei  arriving  per  square  cm 
per  unit  solid  angle  per  second  is  approximately  0.1. 


^  S.  F.  Singer,  "The  primary  cosmic  radiation  and  its  time  variations",  Progress  in 
Elementary  Particle  and  Cosmic  Ray  Physics,  Vol.  4,  p.  205  (1958).  This  article  also  contains 
an  extensive  bibliography. 
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The  general  features  of  the  mass  spectrum  are : 
(i)  protons  comprise  the  bulk  of  the  radiation, 
(ii)  a-particles  comprise  10  to  16  per  cent  of  the  radiation, 
(iii)  heavy  nuclei  do  occur  but  constitute  only  1  to  2  per  cent  of  the  total, 
(iv)  electrons  and  photons  are  almost  completely  absent  and  their  abimdance 

has  an  upper  limit  of  1  per  cent  and  0.1  per  cent  respectively. 


Nuclei 


Primary  Cosmic  Radiation^ 


Number  per 
10*  protons 


As  a  %  of 
primary 
particles 


%  of  primary 
nuclons  carried 


Universal 

abundance  per 

10*  hydrogen  atoms 


Protons  (/>) 

a  particles 

{Z  =  2) 

L-group 

(Ld,  Be,  B 

Z  =  3  to  5) 

M  group 

(C,N,0,F,Z  =  6to9) 

H-grou^ 

{Z  ^  10) 

Iron 

(Fe  Z  =  26) 

All  nuclei 

{Z  >  30) 


100,000 
10,000 

<50(?) 

520  (400) 

160 

30 

<  1 


91.5 

7.8 


0.4 
0.15 

<  0.001 


69 
23 


4.5 
3.5 


100,000 
7700 

3.6  X  10-* 

80 

30 

1.5 

10-3 


Regarding  the  neutron  composition  of  primary  radiation  the  position  is  not 
clear.  It  is  natural  to  expect  that  the  neutrons  originate  from  a  near  source  like 
the  sun  since  we  know  that  the  lifetime  of  a  neutron  is  very  short.  But  it  is  pos- 
sible for  very  high  energy  neutrons  to  have  a  life  long  enough  due  to  relativistic 
time  dilatation  so  that  it  can  be  assumed  that  they  may  have  come  from  galactic 
sources. 

The  accurate  determination  of  the  mass  spectrum  is  subject  to  considerable 
error  owing  to  the  low  intensity  of  the  heavy  primaries  and  the  uncertainty 
in  the  measurement  of  ionization.  The  heavy  nuclei  have  a  large  probability  of 
losing  energy  by  ionization  even  within  the  top  layer  of  the  atmosphere.  Although 
the  presence  of  Li,  Be  and  B  have  been  identified,  still  the  possibility  of  heavy 
nuclei  splitting  into  Li,  Be  and  B  by  collisions  with  air  molecules  remains  and  this 
makes  the  determination  of  the  mass  spectrum  very  uncertain. 


^  V.  L.  GiNZBURQ,  Progress  in  Elementary  Particle  and  Cosmic  Ray  Physics  4,  339  (1958). 
At  the  recent  Kyoto  Conference  (September  1961),  new  data  was  reported  by  M.  M.  Shapiro 
et  al.  The  principal  changes  in  the  above  table  will  be  that  the  universal  abundance  of  the 
Land  M  groups  are  6  x  10-*  and  150,  respectively,  the  corresponding  cosmic-ray  abundance 
being  110  and  400,  respectively,  i.  e.  the  presence  of  the  Li  group  is  well  established. 
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An  examination  of  the  table  leads  to  the  following  conclusions : 

(a)  There  is  considerable  doubt  as  to  the  exact  composition  of  light  elements,^ 

(b)  There  is  a  virtual  absence  of  L-flux  or  flux  of  light  nuclei  with  less  than 
0.4  m~2  sterad"^  sec~^. 

The  first  thing  that  strikes  us  about  the  charge  spectrum  is  that  it  nearly  cor- 
responds to  the  abundance  of  the  elements  in  the  universe.  But  closer  examination 
shows  significant  differences  like  the  following : 

(1)  the  presence  of  Li,  Be  and  B  in  appreciable  quantities, 

(2)  a  pronounced  peak  at  iron,  and 

(3)  the  C  :  N  :  O  ratio  is  inverse  to  that  of  cosmic  matter  which  is  1:2:6 
according  to  Suess  and  Urey.^  It  is  the  study  of  these  differences  that  will  reveal 
information  about  these  high  energy  particles  and  their  interactions.  The 
explanation  of  these  differences  lies  in  one  of  the  two  possibilities : 

(i)  The  original  radiation  corresponds  to  universal  abundance,  the  difference 
arising  due  to  subsequent  interactions  before  reaching  the  earth. 

(ii)  The  composition  of  the  original  radiation  is  itself  quite  different  from  what 
could  be  expected  from  cosmic  abundance  and  the  subsequent  fragmentation  is 
of  such  a  nature  as  to  yield  a  spectrum  resembling  the  cosmic  abundance  with 
observed  deviations.  In  either  case  it  is  necessary  to  devise  a  fragmentation 
theory  which  is  of  course  more  important  in  the  latter. 

Adopting  the  second  point  of  view  and  assuming  that  the  primary  radiation 
oonsistsessentiallyofnuclei  of  metals  the  other  components  being  the  products  of 
fragmentation,  we  can  list  the  following  points  in  favour  of  it : 

(1)  It  would  explain  the  observed  abundances  of  Li,  Be  and  B, 

(2)  The  peak  in  the  abundance  at  iron  and  the  natural  preponderance  of  heavy 
nuclei  are  explained, 

(3)  There  may  be  injection  mechanisms  favouring  heavy  nuclei. 

In  this  connection  it  is  necessary  to  stress  the  importance  of  not  merely  the 
relative  abundance  of  various  nuclei  irrespective  of  their  energy  but  also  at 
various  energies  since  as  we  shall  see  later,  the  age  of  a  cosmic  ray  particle  is  closely 
connected  with  its  energy,  e.g.  in  the  Fermi  theory  of  cosmic  radiation  (which  will 
be  discussed  in  great  detail  in  a  later  chapter)  energy  increases  with  age.  Hence 
at  the  low  energy  end  of  the  spectrum  fewer  fragmentation  products  are  to  be 
expected  but  this  is  in  disagreement  with  the  currently  accepted  statement 
that  all  primary  components  have  similar  velocity  spectra.  This  implies  that  the 
mean  free  path  of  cosmic  rays  in  the  galaxy  is  shorter  than  the  mean  free  path  for 
nuclear  collision  and  has  led  to  a  revision  of  the  Fermi  theory  by  Rossi  ^  et  al. 


^  The  L :  M  ratio  quoted  by  Bristol  workers  is  1  :  1  while   a  different  ratio  has  been 
obtained  using  a  different  determination  of  fragmentation  probabilities  by  Gottstein  et  al. 

2  H.  C.  Urey  and  Suess,  Rev.  Mod.  Phys.  28,53(1956). 

3  B.  Rossi,  S.  Olbert  and  P.  Morrison,  Phys.  Rev.  94,  440  (1954). 

BPOR      24 
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A  clear  cut  experiment  demonstrating  the  absence  of  Li  etc.  above  the  atmosphere 
would  be  fatal  to  the  hypothesis  of  the  origin  of  the  primary  radiation  by  frag- 
mentation. 

EnernT}'  spectrum  and  spatial  distribution 

It  is  very  important  to  know  the  energy  and  spatial  distribution  and  the 
associated  quantities  like  the  intensity,  energy  density  and  number  density  of 
cosmic  ray  particles  of  various  types. 

The  energy  distribution  has  two  characteristic  features : 

(1)  a  low  energy  cut-oflF; 

(2)  a  power  law  spectrum,  i.e.  the  number  of  particles  of  type  j  (irrespective 
of  their  direction)  between  energy  E  and  E  +  dE  is  given  by 

/  E  \y-i  d^ 

on  the  assumption  that  no  particle  has  energy  below  Ep.  Nj  is  the  total  flux 
coming  from  all  directions.  The  number  of  particles  above  an  energy  E  is  given  by 

N 

lN^{E)dE  =  Nj(^J~\ 

Considerable  theoretical  significance  cannot  be  attached  to  a  power  law  spectrum 
especially  when  experimental  data  requires  y  to  be  chosen  suitably  for  different 
energy  ranges  and  for  different  types  of  particles.  However,  it  was  in  an  attempt 
to  explain  the  power  law  spectrum  that  Fermi  was  led  to  making  a  far  reaching 
hypothesis  regarding  the  origin  of  cosmic  radiation.  According  to  present  ex- 
perimental data  the  following  values^  for  y  are  found  suitable. 


Energy  range 
per  nucleon 

Value  of 

Y 

0.5  and  1  GeV 

~2 

0.35  and  10-20  GeV 

1.9  to  2.2 

5  X  1010  to  1012  ev 

-2.5 

1013  and  10i«  eV 

2.7  to  3 

above  2  GeV 

2.5 

The  most  striking  feature  of  the  spatial  distribution  is  the  near  isotropy  of 
cosmic  radiation,  a  feature  which  has  to  be  explained  in  any  theory  of  its  origin. 
Since  cosmic  rays  are  charged  particles  it  is  to  be  expected  that  their  motion 
should  be  affected  by  the  magnetic  field  of  the  earth.  The  Lorentz  force  acting 
on  charged  particles  in  a  magnetic  field  deflects  particles  which  would  otherwise 

1  V.  L.  GiNZBURG,  loc.  cit. 
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Fig.  49.  Dependence  of  cosmic  ray  intensity  on  magnetic  latitude. 
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Fig.  50.  Cosmic  ray  ionization  as  a  function  of  altitude  at  four  different  latitudes. 
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travel  in  straight  lines.  Such  deflections  should  naturally  affect  the  intensity 
distribution  with  latitude.  One  remarkable  feature  of  this  geomagnetic  effect  is 
that  certain  directions  of  arrival  are  forbidden  depending  on  the  latitude  of 
observation  and  the  momentum  of  the  particles  while  the  intensity  in  the  allowed 
directions  is  still  isotropic  and  it  is  only  due  to  the  fact  that  certain  directions 
are  forbidden  that  the  intensity  varies  with  latitude. 

The  variation  of  cosmic  ray  intensity  with  latitude  at  sea  level  was  first  re- 
ported by  Clay^  and  later  confirmed  by  Compton*  and  co-workers  and  the  vari- 
ation with  geomagnetic  latitude  was  better  correlated.  All  the  observations  in- 
dicated that  the  cosmic  ray  intensity  does  not  vary  very  much  above  a  geo- 
magnetic latitude  50°  N  or  S.  A  decided  seasonal  effect  was  also  noted  and  this  was 
believed  to  be  correlated  with  atmospheric  temperature. 

The  latitude  eflFect  is  completely  explained  as  will  be  shown  later  by  studjdng 
the  motion  of  a  charged  cosmic  ray  particle  in  the  magnetic  field  of  the  earth. 
The  graph  representing  the  variation  of  cosmic  ray  intensity  with  latitude  is 
given  in  Fig.  46. 

The  fact  that  the  intensity  does  not  vary  beyond  a  certain  latitude  implies 
that  there  is  a  low  energy  cut-oflF  and  that  for  particles  with  energy  greater  than 
this,  all  directions  are  allowed  above  a  particular  latitude.  This  has  been  discussed 
in  the  chapter  on  geomagnetic  effects. 

2.  TIME  VARIATION   OF   COSMIC   RAYS' 

The  subject  of  time  variations  of  cosmic  rays  has  become  important  in  recent 
years  with  the  increasing  realization  that  it  will  give  some  clue  to  the  origin  and 
production  of  cosmic  radiation,  its  mode  of  propagation  and  the  influence  of  the 
electromagnetic  fields  in  interplanetary  and  perhaps  interstellar  space.  It  is 
reasonable  to  expect  that  all  the  known  time  variations  of  low  energy  cosmic 
rays  are  mostly  due  to  events  in  the  solar  system. 

The  solar  daily  variation  of  cosmic  ray  intensity  exhibits  a  number  of  features 
which  are  clearly  of  non-meteorological  origin.  Moreover  many  of  these  relate 
to  radiation  outside  the  influence  of  the  geomagnetic  field  and  provide  information 
concerning  the  anisotropy  of  primary  radiation.  The  anisotropy  is  of  a  variable 
character  undergoing  change  from  day  to  day  and  its  twelve-month  average 
characteristics  also  change  markedly  from  year  to  year  with  striking  relationships 
with  the  eleven  and  twenty- two  year  cycles  of  solar  activity. 

Because  of  the  obvious  difficulty  of  making  continued  observations  at  high 
altitudes  where  the  nucleonic  component  predominates,  most  of  the  observations 
are  done  at  sea  level  or  below  the  earth  where  the  cosmic  radiation  consists  mostly 
of  the  mesonic  component.  It  has  been  found  that  meteorological  factors  have 

1  J.  Clay,  Proc.  Roy.  Acad.,  Armt.  30,  1115  (1927). 

2  A.  H.  CoMPTON  and  R.  N.  Turner,  Phys.  Rev.  52,  799  (1937). 

3  S.  F.  Singer,  loc.  cit.  See  also  V.  Sarabhai  and  N.  W.  Nerurkar,  Ann.  Rev.  Nud. 
Sci.  6,  1  (1956). 
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less  effect  on  the  day  to  day  fluctuations  of  the  nucleonic  component  than  on 
the  mesonic  component.  When  all  the  local  atmospheric  effects  have  been  taken 
into  account,  some  effects  remain  which  seem  to  be  truly  characteristic  of  the 
primary  radiation.  For  instance,  the  daily  variations  of  the  nucleonic  component 
measured  at  different  latitudes  and  averaged  over  a  large  number  of  days  have 
a  peak  to  peak  amplitude  of  about  1  per  cent  which  is  comparable  to  the  ampli- 
tudes obtained  at  sea  level,  of  mesonic  intensities. 

The  time  variations  of  cosmic  rays  can  be  classified  as  follows : 

The  diurnal  variation 

This  is  due  to  the  observation  from  the  rotating  earth  of  anisotropic  radiation. 
The  presence  of  this  anisotropy  is  not  clearly  understood  and  in  fact  its  effect  is 
quite  small. 

The  diurnal  variation  of  the  secondary  component  measured  at  sea  level  and 
at  mountain  altitudes  by  means  of  ionization  chambers  and  counters  is  less  than 
1  per  cent,  the  value  depending  on  the  angular  aperture  of  the  telescope,  while 
that  of  the  neutron  component  is  perhaps  three  times  larger.  The  diurnal  variation 
seems  to  be  energy  dependent,  repeats  itself  for  a  few  days,  but  may  then  die  out 
and  disappear.  It  seems  to  be  enhanced  during  periods  of  magnetic  activity  and 
shows  a  27-day  recurrence.  The  phase  of  maximum  of  the  variation  seems  to  vary 
in  a  regular  manner  during  the  solar  cycle. 

Alfven  has  used  these  data  to  put  forth  a  mechanism  for  acceleration  of  cosmic 
rays  within  the  solar  system  by  means  of  magnetic  field  variations.^  It  is  first 
necessary  to  translate  from  the  maximum  phase  of  the  diurnal  variation  to  the 
direction  of  the  anisotropy  outside  of  the  earth's  magnetic  field.  He  interprets 
the  resultant  anisotropy  as  due  to  an  outflow  or  inflow  of  cosmic  rays  across  the 
earth's  orbit  coupled  with  a  tangential  component  due  to  the  earth's  motion; 
the  predominance  of  the  outflow  indicates  the  production  of  cosmic  rays  near 
the  sun.  A  model  has  been  suggested  in  which  the  electric  field  of  a  solar  corpus- 
cular beam  modulates  the  cosmic  ray  intensity.  Another  possibility  also  exists, 
viz.  the  field  of  the  rotating  earth  interacting  with  the  streaming  interplanetary 
gas  sets  up  electric  fields  which  modulate  cosmic  ray  intensity  over  a  restricted 
range  of  azimuthal  angles. 

Variations  at  the  source  of  cosmic  radiation 

These  may  be  sudden  spectacular  increases  as  are  observed  in  synchronization 
with  solar  flares  or  variations  which  show  a  trend  throughout  the  sun  spot  cycle 
which  one  might  call  secular  variation.  These  may  indicate  variations  in  the  output 
of  cosmic  rays  which  fall  into  two  classes  of  solar  origin. 

(i)  Short  term  variation  due  to  solar  flares:  Five  major  solar  flare  effects  on 
cosmic  rays  have  been  observed  before   1956. 

1  H.  Alfven,  Tellus,  6,  232  (1954). 
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(1)  28th  February  1942,  (2)  7th  March  1942,  (3)  25th  July  1946,  (4)  19th  No- 
vember 1949  and  (5)  23rd  February  1956. 

On  the  first  four  occasions  no  increase  has  been  observed  at  Huancayo  near  the 
geomagnetic  equator.  Therefore  the  primary  particles  responsible  for  these  increases 
were  thought  to  be  charged  particles  with  energy  <  15  BeV,  which  is  the 
threshold  energy  below  which  protons  cannot  penetrate  the  earth's  magnetic 
field  at  the  equator.  The  last  event  was  the  largest  and  the  produced  particles 
were  detected  at  the  geomagnetic  equator.  Observations  on  the  1949  flare  showed 
a  5  to  6-fold  increase  of  neutron  intensity  as  compared  to  a  12  per  cent  increase 
in  the  ionizing  component  at  the  same  station.  The  large  relative  increase  in 
neutron  intensity  is  expected  since  primary  protons  of  energy  <  10  BeV  are 
much  more  efficient  at  producing  neutron  rather  than  meson  secondaries. 

During  the  1956  flare,  a  large  increase  in  intensity  occurred  simultaneously 
beyond  25°  latitude.  The  nucleonic  component  predominated  over  the  mesonic 
and  the  radiation  was  in  general  more  energy  dependent  than  the  normal  cosmic 
ray  intensity.  There  was  an  average  increase  of  5.7  per  cent  near  the  magnetic 
equator  in  India  which  may  be  ascribed  to  a  flare  of  50  BeV  protons  coming 
direct  from  the  sun.  As  regards  the  dependence  on  energy  the  components  below 
10  BeV  undergo  greater  variations  than  those  above  it.  In  general,  the  energy 
spectrum  of  the  primaries  associated  with  solar  flares  is  much  steeper  than  that 
of  the  normal  primaries. 

It  seems  likely  that  the  primary  particles  responsible  for  these  increases  are 
accelerated  on  or  near  the  sun  by  some  mechanism  associated  with  a  flare, 
although  there  is  a  definite  time  delay  between  the  two  phenomena.  Maximum 
cosmic  ray  intensity  seems  to  occur  later  than  maximum  flare  intensity  by  up  to 
an  hour  and  the  cosmic  ray  intensity  remains  above  average  several  hours  after 
the  flare  has  disappeared. 

Geomagnetic  analysis,  assuming  that  these  particles  come  from  the  sun,  ex- 
plains very  well  the  extreme  longitude  dependence  of  these  increases.  From  this 
analysis  an  upper  limit  of  .^  5  x  10^^  gauss  cm^  has  been  derived  for  the  solar 
dipole  moment.  It  is  significant  to  observe  that  the  rate  of  decay  of  intensity 
after  the  maximum  of  the  solar  flare  increase  has  been  reached  depends  greatly 
on  the  location  of  the  observer.  It  is  therefore  thought  that  such  changes  in  polar 
regions  are  due  to  particles  which  do  not  come  directly  from  the  sun  but  have 
been  redirected  and  temporarily  stored  by  magnetic  fields. 

Various  models  for  such  a  magnetic  field  have  been  discussed.  Attempts  have 
been  made  to  explain  the  polar  increases  in  terms  of  defiections  by  a  galactic 
field.  It  has  been  shown  that  both  the  geographic  distribution  and  the  intensity 
decay  could  be  explained  by  assuming  the  solar  flare  particles  to  progress  along 
a  weak  interstellar  field  with  all  possible  pitch  angles.  The  reflection  of  the  particles 
from  a  heliocentric  boundary  of  radius  of  about  1 .2  astronomical  units  has  been 
also  considered  as  contributing  to  the  temporal  variations. 

(ii)  Long  term  variation  due  to  suTispot  effects:  The  long  term  variations  of 
cosmic  ray  intensity  are  associated  with  sun  spot  activity  and  continuous 
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observations  extend  back  only  to  1937.  The  general  intensity  is  directly  related 
to  the  number  of  sun  spots,  the  change  in  the  twelve  month  mean  intensity 
being  of  the  order  of  4  per  cent  between  the  periods  of  sun  spot  maximum  and 
minimum.  The  change  was  the  same  both  on  magnetically  quiet  as  well  as  on 
disturbed  days.  It  was  found  that  there  was  a  50  per  cent  increase  in  ionization 
at  70,000  ft  for  the  sun  spot  minimum  year  1954  as  compared  to  the  sun  spot 
maximum  year  1937.  These  changes  were  much  higher  at  higher  altitudes.  The 
intensity  in  the  upper  atmosphere  in  1940  was  found  to  be  much  greater  than  in  1954. 

There  were  large  changes  of  the  twelve  month  average  daily  variation  of  the 
meson  intensity  between  the  years  1953  and  1956.  These  can  be  related  to  either  the 
eleven  or  twenty-two  year  cycle  of  sun  spot  activity.  The  correlation  was  best 
during  the  sun  spot  minimum  indicating  that  the  variations  are  related  to  the 
solar  activity  in  the  equatorial  belt  of  the  sun.  In  fact  it  was  later  found  to  be 
correlated  with  the  activity  of  the  5303  A  coronal  emission. 

Associated  with  the  solar  cycle  intensity  variation  is  the  phase  shift  of  the 
diurnal  variation.  The  shift  of  the  time  of  maximum  diurnal  intensity  was  found 
to  be  independent  both  of  the  direction  of  arrival  of  the  particle  and  the  geo- 
magnetic disturbance  of  the  day. 

At  Huancayo  the  twelve  month  mean  daily  variation  exhibited  a  maximum 
at  noon  from  1937  to  1943  and  a  maximum  at  early  morning  in  1952.  In 
the  intervening  period  there  was  a  progressive  increase  of  the  early  morning 
maximum  and  decrease  of  the  noon  maximum.  During  1945-50,  the  daily  variation 
exhibited  two  maxima.  Only  half  the  cycle  of  changes  are  completed  in  eleven 
years,  indicating  a  twenty- two  year  cycle. 

Modulation  effects 

These  variations  are  brought  about  by  electromagnetic  conditions  of  short 
duration.  These  may  either  be  sudden  decreases  (For bush  decreases)  of  cosmic  ray 
intensity  often  associated  with  world  wide  magnetic  storms  or  variations  which 
show  a  twenty- seven  day  recurrence  relation  similar  to  that  of  the  small  magnetic 
storms  and  auroral  displays  and  hence  related  to  the  rotational  period  of  the  sun. 

The  great  world-wide  storms  have  a  fairly  well  defined  pattern  characterized 
by  a  sudden  increase  in  the  horizontal  field  intensity  (.5-1  per  cent)  which  lasts 
for  an  hour  or  so  followed  by  a  slow  recovery  over  a  period  of  a  days. 

The  Forbush  type  of  decreases  are  quite  striking  and  in  some  cases  are  as  large 
as  10  per  cent.  While  they  are  often  associated  with  magnetic  storms  there  are 
many  examples  where  the  decreases  occur  without  storms  and  conversely  storms 
without  decreases.  This  suggests  that  the  correlation  between  the  two  is  not  an 
immediate  one  but  goes  back  to  a  common  cause.  Alfven^  (1946)  first  suggested 
that  the  emission  of  corpuscular  matter  from  the  sun  which  is  responsible  for 
magnetic  storms  is  also  responsible  for  the  cosmic  ray  decrease.  In  Morrison's^ 


1  H.  Alfven,  Nature,  Lond.  158,  618  (1946). 

2  P.  Morrison,  Phys.Rev.  101,  1397  (1956). 


376  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

model  clouds  of  ionized  matter  containing  turbulent  magnetic  fields  but  free 
from  cosmic  radiation  are  being  continually  emitted  from  the  active  solar  regions. 
The  cosmic  rays  gradually  diffuse  into  these  clouds  and  when  the  earth  is  sur- 
rounded by  such  a  cloud,  decreases  in  cosmic  ray  intensity  are  observed  due  to 
scattering  of  the  primary  particles. 

Dorman^  considers  an  ionized  solar  matter  beam  to  carry  a  frozen  magnetic 
field  derived  either  from  the  dipole  field  of  the  sim  or  from  the  local  field  at  its 
origin.  The  trapped  field  gets  progressively  attenuated  with  increasing  distance 
from  the  sun  and  can  also  get  weakened  and  randomized  due  to  the  onset  of  tur- 
bulence at  a  certain  stage.  Changes  of  cosmic  ray  intensity  and  isotropy  could 
occur  (1)  because  of  the  electric  polarization  of  the  beam  and  (2)  due  to  magnetic 
deflection  of  low  energy  primaries.  Modulation  of  intensity  alone  indicates  presence 
of  beams  largely  randomized  through  turbulence. 

A  systematic  analysis  of  cosmic  ray  data  along  with  the  geomagnetic  disturb- 
ances has  been  presented  recently  by  Sarabhai  et  al}  who  observe :  "At  the  present 
moment  no  detailed  verification  of  the  modulation  theories  involving  cosmic  ray 
beams  could  be  considered  to  have  been  achieved ;  it  appears  that  beams  certainly 
play  an  important  role  but  it  is  more  than  likely  that  we  have  other  processes 
as  well". 

Prehistoric  intensity  variations 

It  has  been  estimated  using  deep  core  samples  that  the  cosmic  ray  flux  has  not 
varied  by  more  than  10-20  per  cent  over  the  last  35,000  years;  other  methods 
based  on  meteorites  or  nuclear  disintegrations  in  the  atmosphere  have  been 
suggested  to  allow  even  longer  extrapolation  or  to  cross-check  the  existing 
analysis. 


1  DoRMAN,  J.  Expt.  Theoret.  Phys.  26,  337  (1953). 

2  V.  Sababhai  and  Satyaprakash,  Proc.  Ind.  Acad.  Sci.  51,  84  (1960). 


CHAPTER  XI 

GEOMAGNETIC  EFFECTS 

Since  primary  cosmic  radiation  consists  essentially  of  charged  particles  it 
is  to  be  expected  that  the  magnetic  field  of  the  earth's  dipole  should  influence 
the  direction  and  intensity  of  the  radiation  through  the  deflection  of  the  particles 
as  they  approach  the  earth.  We  begin  the  study  with  the  general  mathematical 
theory  of  the  motion  of  a  charged  particle  in  the  field  of  a  magnetic  dipole  and 
then  deal  with  the  particular  considerations  of  our  problem,  i.e.  the  relation 
between  the  geomagnetic  field  and  the  intensity  of  cosmic  radiation  at  the  surface 
of  the  earth. 

1.  MOTION  OF  A  CHARGED  PARTICLE  IN  THE  FIELD 
OF  A  MAGNETIC  DIPOLE^ 

Equations  of  motion 

The  theory  of  the  motion  of  a  charged  particle  in  a  magnetic  field  with  special 
reference  to  that  of  the  earth's  dipole  was  first  developed  in  a  series  of  papers 
by  Stormer  and  later  extended  by  Lemaitre,  Vallarta^  and  their  coUaborators. 

The  equation  of  motion  of  a  particle  of  charge  e  in  an  electromagnetic  field 
with  electric  and  magnetic  components  M  and  H  respectively  is 

^{mr)^e[E+~rxH\  (1) 

where  m  is  the  relativistic  mass,  c  the  velocity  of  light  and  r  the  position  vector 
of  the  particle.  In  a  pure  magnetic  field,  E  being  zero,  the  force  on  the  particle 
is  always  perpendicular  to  the  direction  of  motion ;  the  energy  and  the  absolute 
values  of  the  momentum  and  velocity  and  consequently  the  relativistic  mass  m 
are  constants.  Thus  we  can  write 

We  shall  now  assume  that  the  centre  of  the  dipole  is  situated  at  the  origin 


^  Parts  of  this  chapter  are  based  upon  the  treatment  of  the  subject  found  in  Cosmic  Eays^ 
L.  Janossy,  Oxford. 

2  C.  Stormer,  Z.  Astrophys.  1,  237  (1938).  G.  Lemaitre  and  M.  S.  Vallarta,  Phys.  Rev. 
43,  87  (1933). 
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and  its  moment  M  is  along  the  z-axis.  The  field  at  a  point  r  with  Cartesian  compo- 
nents X,  y,  z  is  given  by 

H=-grad-^.  (3) 

Since  the  velocity  and  mass  are  constants,  Stormer  simplified  the  equations  of 
motion  by  introducing  a  new  independent  variable  s  instead  of  t  where  s  =  vt. 
Thus,  if  velocities  are  measured  with  respect  to  s,  all  particles  have  the  magnitudes 

of  their  "new"  velocities  v,  equal  to  unity. ^  Further  we  normalize  I ]  ^  which 

has  the  dimension  of  length  to  unity  by  a  proper  choice  of  the  unit  of  length  and  v 
is  measured  in  terms  of  these  units.  The  equations  of  motion  in  Cartesian  co- 
ordinates thus  reduce  to 

A^x  1 

d^i/  1 

-^  =  -  ^[(3^^  -  r^)  {x'  -Zxzz')-]  (4) 

d^z  1 

=  ---j{Zxzy'  -^zyx'-\ 


&8^  fS 

where 

dx  dy  dz 

X  =  — — ,  y  =  — p-  and     z  =  -r — . 

ds  ^        ds  ds 

We  shall  now  show  that  when  the  equations  are  expressed  in  cylindrical  instead 
of  Cartesian  coordinates,  the  study  of  the  peculiar  features  of  the  dynamical 
problem  is  .made  easier. 

If  2,  Q,  (p  are  the  cylindrical  coordinates 

^2  _  ^2  _  ^2;  tan  (p  =  ylx.  (5) 

After  simple  calculation  we  obtain 

e^^  =  2g-Q^I,^  (6) 

i  •"'-IT  <'—'  =  '-(¥-*)'  =  « 

where  2gr  is  the  constant  of  integration  obtained  in  deriving  (6).  Its  interpretation 

will  be  given  presently.  Since  equation  (1)  involves  only  r,  g  and  z,  it  represents 

the  motion  in  the  meridional  plane  (o,  z  plane)  and  can  be  studied  without  reference 

to  (p.  The  angular  velocity  of  the  meridional  plane  about  the  z  axis  (the  axis 

dw 
of  the  dipole),  is  given  by -5 —  which  is  a  function  of  g  and  z. 

1  v  is  a  number  denoting  the  magnitude  of  the  velocity.  Hence  8  has  the  dimension  of  time 
but  is  numerically  equal  to  the  length  of  unity  described  in  time  t.  Though  the  newly  defined 
velocity  is  unity  for  all  v,  any  casual  reference  to  velocity  and  momentum  of  the  particle  would 
mean  that  they  refer  to  the  case  when  the  independent  variable  is  t. 
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Since  l^^p  is  unity,  J  |^«P  =  ^U  (this  is  proportional  to  the  kinetic  energy 
for  non-relativistic  particles)  and  this  term  can  be  resolved  into  two  parts — 
one  corresponding  to  the  motion  in  the  meridional  plane  (^/2)  and  the  other  to 
the  motion  perpendicular  to  the  meridional  plane  equal  to 

i{Q9^r  =  'l2-QI2  (8) 

the  prime  denoting  differentiation  with  respect  to  s.  This  quantity  can  be  fictitiously 
reckoned  to  be  the  potential  of  the  particle  when  we  are  considering  the  motion 

only  in  the  meridional  plane.  Janossy  calls  — —  the  "potential"  but  it  is  more 

logical  to  treat  (8)  as  the  "fictitious"  potential  for  the  obvious  reason  that  in  the 

non-relativistic  case  the  sum  of  the  "fictitious  potential"  and  the  kinetic  energy 

in  the  meridional  plane  should  give  the  total  kinetic  energy  of  the  particles  the 

actual  potential  energy  of  which  in  the  field  of  the  dipole  is  zero.  Two  equations 

Q 
immediately  follow  on  recognizing  —  to  be  J  {q'  ^  -^  z'^)  in  the  meridional  plane : 

Q    -^-^,         z    =4^.  (9) 

We  shall  now  interpret  2g  in  physical  terms. ^  q^  -~-  =  Q^(p  is  proportional  to 

the  kinetic  angular  momentum  of  the  particle.  According  to  relativistic  electro- 
dynamics, when  the  vector  potential  is  not  zero  in  a  magnetic  field,  the  total 
angular  momentum  is  to  be  distinguished  from  the  kinetic  angular  momentum. 

o  7" 

If  L  is  the  Lagrangian  the  particle,  -^-r-  is  the  total  angular  momentum  and 

or  J  0(p 

-~r  =  -T-  [A  •  v]  where  A  is  the  vector  potential  and  v  the  velocity.  The  coefficient 

of  ^  •  i;  is  actually which  has  been  normalized  to  unity.  As  r  ->-  oo  the  total 

angular  momentum  tends  to  be  essentially  kinetic.  It  is  the  total  momentum  2g 
that  is  conserved  during  the  motion.  If  A  is  the  latitude  and  ca  ,  e^ ,  e^  are  three 
mutually  perpendicular  unit  vectors 

e^  =  e^xe,.  (10) 

V  =  f  ej.-\- r  Xe;^-\- r(pGos?.e^.  (11) 

The  vector  potential  of  the  magnetic  field  of  the  dipole  is 


Mx  F— 

r 


MKx  V'~ 
r 


=  M^e.  (12) 


^^^^^^^^M(cosAe^-2^nAe^^  (13, 


^  This  is  based  on  the  discussion  by  Fermi,  Nuclear  Physics,  Chicago  University  Press. 


(14) 
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The  relativistic  Lagrangian  of  the  particle  is  given  by 

Jj=  —  wIq c*)/l  —  )?*  H A*v  where  /5  =  v\c 

c 

=  -moC*j/l-/?»  +  -^ —<p. 

d    dL       dL 

dL  dL 

and  L  is  not  an  explicit  function  of  99,  — —  =  0  or  — -  is  a  constant  of  motion 

eM  cos^A 
c         r 

(le) 


1r*cos*A  eM  cos^A 

— ;; — ^  +  "7^~~*r- 


If  we  define  (o  as  the  angle  which  the  tangent  to  the  trajectory  makes  with  the 

east-west  direction,  then 

Vg>       rcosX  .-„. 

cosca  =  — '^  = w  .  (17) 

V  V 

Hence  ,gjl^  .  cos^A       . 

^^^=  rcos/coso)  +    ==2  or.  (18) 

p  \mvcJ      r 

Thus  we  have  identified  2gr  to  be  proportional  to  the  total  angular  momentum 
associated  with  the  particle. 

2.  STORMER  AND  MAIN  CONES 

We  write  the  equation  for  cos  co  as 

2g  cos  A  ... 

cosa>  = J r— .  (1) 

rcos/  r^ 

I  e  M  \V2 
Using  Stormer  units  of  length,  i.e.  setting  I I     =  1  since  r  is  measured  in  terms 

of  Stormer  units,  the  radius  of  the  earth  /•o(^)  in  these  units  depends  upon  the 
energy  of  the  particle.  Thus  in  studjdng  the  trajectory  of  a  particle  of  momentum  p 
the  radius  of  the  earth  is  assigned  a  value  rQ(p)} 

Since  cosw  can  lie  only  between  — 1  and  +1  it  is  clear  that  (1)  imposes  re- 
strictions upon  the  values  of  g  and  r.  The  condition  on  cos  co  implies  the  non- 
negative  nature  of  the  kinetic  energy  Q  since 

^  =  (1  —  cos  ^w) 
so  that  the  two  hyperbolae 

coso)  =  +1  1  2q  cos  A 

cosw  =  —  1  f       rcos  A  r^ 


^  See  for  example,  Meixner's  lucid  treatment  in  "  Cosmic  Radiation",  edited  by  W.  Heisen- 
BERG  (1946),  Dover. 
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for  a  given  A  divide  the  r-g  plane  into  regions  of  positive  and  negative  kinetic 
energy,  the  latter  being  forbidden. 

A  trajectory  of  a  particle  in  the  meridional  plane  is  characterized  by  the 
co-ordinates  r  and  X  corresponding  to  a  given  g.  In  the  r-g  plane  it  is  represented 
by  a  line  parallel  to  the  r-axis  since  g^  is  a  constant  of  motion. 

Fig.  51  shows  the  curves  corresponding  to 

cos  CO  =  ±  1 

in  the  r-g  plane  for  a  particular  X.  The  following  geometrical  features  are  im- 
mediatety  evident : 
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Fig.  51.  The  curves  (I)  cosco  =  +  1,  (H)  cosco  =  —  1  and  dP/dx  =  0  for  a  given  1. 
SoHd  curves  refer  to  A  =  60° ;  broken  ones  for  A  =  0. 

(i)  g  =:  gj^=  cos'/^A  is  a  line  tangential  to  the  curve  /  at  r  =  cos'/^A  and  cuts 
the  curve  II  at  r  =  rx=  {]/2  —  1)  cos  '^^A.  As  X  increases  the  critical  values  of 
gx  and  r;^  decrease  so  that  the  curve  /  advances  towards  the  origin.  The  maximum 
of  this  critical  value  of  g^  as  A  varies  is  unity  and  occurs  for  X  =  0. 

(ii)  Any  line  g  =  constant  <  cos'^^A  cuts  only  the  curve  //  and  in  particular 
^  =  0  cuts  the  curve  //,  at  r  =  cos'^"  X.  The  maximum  of  this  r  as  A  varies  is  unity 
a,nd  occurs  for  X=  0. 

(iii)  For  a  given  r  as  cos  co  varies  from  —  1  to  +  1,  2g  varies  from  2g  =  — rcosA 
+  (cos^A)/r  to  2g  =  rcosX  +  (cos^A)/r.  However  we  shall  now  show  that  this 
entire  range  of  g  values  is  not  permissible  if  we  impose  certain  conditions  on  the 
trajectories. 

Now  in  order  that  a  trajectory  (represented  by  a  line  parallel  to  the  r-axis) 
may  reach  an  observation  point  on  the  surface  of  the  earth  denoted  by  {rQ{p),  Xq) 
it  must  assume  values  from  +  oo  up  to  rQ{p).  In  case  it  were  to  assume  values 
less  than  rQ{p)  it  will  lie  within  the  earth  and  will  be  forbidden  due  to  the  "shadow 
effect"  of  the  earth.  Ignoring  this  for  the  moment  we  shall  consider  a  trajectory 
of  a  particle  with  momentum  p,  to  be  absolutely  forbidden  if  any  part  of  it 
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(r  >  ro(p))  lies  in  the  forbidden  region  for  any  A.  Since  we  do  not  restrict  the  A 
values  of  the  trajectory  we  require 

9^^  (3) 

to  ensure  that  the  trajectory  of  a  particle  with  momentum  p  such  that  Tq  (p)  <  1 

cuts  the  forbidden  region  whatever  be  the  value  of  A.  Hence  cosco  varies  from  — 1 

to  a  value  -,  , 

,1       \  2  cosAo 

cos  CO.  (Ao ,  ??)  =  --—-2 zr-  W 

foCOsAo  rj 

and  this  corresponds  to  a  cone  called  the  Stormer  cone  and  all  directions  with  co 
between  0  and  m,  are  absolutely  forbidden.  In  terms  of  the  momentum  p  of  the 

P*^''='*  ,,        2_  008^ 

/  e  M  VI* 

which  may  be  obtained  by  using  ordinary  units  of  length,  i.e.  ^^(cm)  =  I I     r 

eMlc  \  pc  / 

and  p^  =  — ^  •  Alternately,  particles  with  momentum  p  <  Pq  will  be  forbidden 

where  ^  212 

_  cos'* A  COSTCO 

^'^^"[l  +  )/l-cos3AcoscaJ2    •  ^^^ 

In  defining  the  above  forbidden  directions  we  require  g^  ^  1  since  there  is  no 
restriction  on  the  A  value  of  the  trajectory.  However  trajectories  with  A  ^  Aq 
corresponding  to  particles  with  momentum  p  such  that  Tq  {p)  <  cos*/*  Aq  are  for- 
bidden if  g'  >  cos'/'  Aq  ,  since  such  trajectories  always  pass  through  the  forbidden 
region.  Thus  g^  =  cos'^'Aq  determines  the  critical  value  codrQ,  Aq)  given  by 

2cos'/«Ao       cosAo  ._. 

cos  (Oc  (Ao  ,  ro)  = ^  (7) 

and  particles  with  momentum  p  and  Tq  (p)  <  eos'/«  Aq  can  approach  the  observation 
point  {^-oTq)  at  an  angle  co  such  that 

—  1  <  cos  CO  <  COSCOc(Ao?*o). 

In  terms  of  the  momentum  of  the  particle 

2COSV2A0        cosAo  ,„, 

cos  ojc  Ao  ^0  =    ,   /    71/ ,   ,    X  .  8 

Another  way  of  representing  this  relation  is  to  ask  for  the  minimum  momentum  Pq 
which  a  particle  should  have  if  it  were  to  approach  the  point  at  an  angle  co. 
This  is  given  by  „ 

^«(^-  ^^  =  cosAO(ni-cosco)J+P'  ^'^ 

Summarizing  we  have 

(i)  For  particles  with  momentum  p  where 

p>  Pu  cosAq,    i.e.    Vq  >  cos'/*Ao 

all  directions  are  allowed  for  trajectories  with  ^  ^  }.q.  For  the  equatorial  plane 
Ao=0. 
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(ii)  If  r^  <  (|/2  —  1)  Gos'i'  Aq  all  directions  will  be  forbidden  for  trajectories 
with  A  ^  Aq  .  Therefore  particles  with  momentum  p  where 

p  <  pP'  cosAo  =  Puip  -  1)2  cosAo 

are  forbidden  in  all  directions  for  A  ^  Aq.  For  the  equatorial  plane  the  critical 
momentum  is  p\^\ 

(iii)  For  particles  with  momentum  p  such  that 

(J/2-1)  cos^^  Ao  <  r^ip)  <  cosV^Ao 

all  trajectories  with  2.  ^  ?.q  will  be  allowed  for  g  ^  cos'/«  Aq  •   The  critical  angle 

ft;'(Aoro)is  o       v  :  i 

,.      ,       2  cos /"An        cos  An 
cos  CO,  (Ao  Tq)  = ■ ^ -y- 

and  all  directions  as  cosco  varies  from  —1  to  the  above  value  will  be  allowed 
The  cone  with  co  varying  from  ji;  to  co,  (i.e.  n  —  co^)  is  called  the  main  cone  and  within 
the  angle  tt  —  w,  all  directions  are  allowed  for  X'^^Xq.  In  terms  of  momentum  p 

all  particles  with  momentum  p>p{l,  co)  -= .—— — r- can  approach 

{r,,X,)  at  an  angle  ca.  ^^^^^l^tl  "  ^^  +  1)' 

3.  DEVIATIONS  FROM  THE  STORMER  AND  MAIN  CONES^ 

We  have  taken  the  trajectory  of  a  particle  to  be  forbidden  if  the  kinetic  energy 
at  any  point  on  the  trajectory  is  negative,  this  being  the  only  condition  which 
has  till  now  been  imposed.  The  allowed  directions  were  defined  as  those  com- 
plementary to  the  forbidden.  There  are  two  other  conditions  which  have  to  be 
satisfied  for  a  direction  to  be  allowed. 

(1)  Since  the  equations  of  motion  cannot  be  explicitly  solved,  we  have  to 
ascertain  whether  the  trajectory  of  a  particle  coming  in  the  allowed  direction 
can  be  traced  to  infinity.  This  was  first  realized  by  Stormer  who  demonstrated 
the  existence  of  periodic  orbits  which  are  confined  to  some  bounded  space. 
A  rigorous  definition  of  periodic  orbits  will  be  given  presently.  Obviously  if  we 
are  not  able  to  trace  the  trajectory  in  any  "allowed"  direction  to  infinity,  then 
that  direction  is  in  fact  forbidden. 

(2)  Even  if  we  are  able  to  trace  orbits  to  infinity  it  will  be  shown  that  in  some 
cases  certain  directions  are  still  forbidden.  If  a  trajectory  is  such  that  before 
reaching  (r^ ,  Aq)  it  assumes  values  of  r  <  ro  it  implies  that  it  intersects  the  earth 
before  reaching  the  observation  point  and  wiU  hence  be  forbidden  by  the  shadow 
effect  of  the  earth.  If  these  two  additional  conditions  are  imposed,  it  is  clear 
that  the  Stormer  and  the  main  cones  previously  defined  provide  an  inadequate 
description  of  the  allowed  regions.  We  shall  now  consider  the  shadow  effect  of 
the  earth. 


1  T.  K.  Radha  and  R.  Thunga,  Report  of  the  Cosmic  Ray  Symposium   (1960)  D.A.E. 
Govt,  of  India. 
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To  consider  these  refinements  it  is  necessary  to  introduce  a  more  convenient 
set  of  coordinates  for  the  description  of  allowed  and  forbidden  regions  in  the 
meridional  plane.  The  main  defect  of  the  polar  representation  r-A  is  that  the 
regions  for  small  r  are  contracted  as  compared  with  those  for  large  r  —  a  defect 
inherent  in  any  polar  representation.  It  is  therefore  more  convenient  to  plot  the 
trajectory  with  A  as  a  Cartesian  coordinate  and  define  the  variable  x  as 

e*=2grr.  (1) 

This  is  known  as  the  Goursat  transformation  and  x  is  a  measure  of  the  energy.^ 
In  addition  we  define  a  new  independent  variable  a  as 

e2«d<T=  (2g)^d8.  (2) 

A  simple  meaning  can  be  given  to  this  transformation  of  the  independent 
variable.  If  we  imagine  the  motion  of  a  particle  in  a  fixed  meridional  plane  (this 
is  not  so  in  the  present  case),  the  velocity  about  the  centre  of  the  dipole  in  the 
two  representations  are  equal. 

Applying  these  transformations,  we  have 


do^  ~^   dx'  da^~^   dX  ^  ^ 


where 


P{x,X)  =  Qe^^-(e-^cosX ^^-)';  a  =  {2g) 

\  cos  A  I 


-4 


(4) 


and         cos  cu  =  lb  r  cos  / -r-^  = -. —  =  smc/=    1 e  ^^]    .         (5) 

as        rcosA  r^  \         a  / 

These  equations  which  are  equivalent  to  (I  -9)  are  not  integrable  in  a  closed 
form,  and  for  their  treatment  there  has  been  recourse  to  general  theorems, 
series  expansions  and  numerical  methods. 

We  shall  here  make  some  general  remarks  upon  types  of  orbits.  The  boundary 
lines  between  the  region  of  positive  and  negative  energy  are  given  in  the  Goursat 
representation  by  those  corresponding  to  P  =  0.  In  the  Goursat  representation, 

the  region  corresponding  to  ^  >  0  is  that  in  which  P  >  0  ( since  P  =    .    ^  Q  j . 

We  shall  first  consider  the  series  of  curves  corresponding  to  P  =  0  in  the  x, 
X  plane  for  various  values  of  g.  This  curve  for  a  given  g  consists  of  two  distinct 
unconnected  parts  A  and  B  corresponding  to  the  pair  of  curves  cosco  =  +  1 
and  — 1  in  the  r-g  plane.  The  diagram  (Fig.  52)  is  a  two-dimensional  (ic,  A) 
representation  of  a  three-dimensional  surface  with  g  as  the  third  dimension.  The 
characteristics  of  the  curves  in  the  r-g  plane  discussed  before  may  be  identified 
from  this  diagram  as  follows : 


1  E.  J.  ScHREMP,  Phys.  Rev.  54,  153,  158  (1937);  Phys.  Rev.  57,  1061  (1940);  C.Stormer, 
loc.  cit.;  G.  Lemaitre  and  M.  S.  Vallarta,  Phys.  Rev.  49,  719  (1936);  50,  493  (1936).  R.  A. 
Hunter,  Phys.  Rev.  55, 15, 107,  614  (1939);  A.  Bangs,  Math.  P%5. 18, 21 1(1939)  ;L.Bonckan- 
RET,  A7in.  Soc.  Sci.  Bruxelles  A  54,  174,  (1937). 
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(1)  The  lower  set  of  curves  (A)  correspond  to  the  curve  cos  co  =  — 1  in  the  r-g 
plane  and  for  a  given  g,  the  higher  the  latitude,  the  smaller  the  x  and  correspond- 
ingly their  r  values.  This  represents  exactly  the  lowering  of  the  curve  cosco  =  — 1 
in  the  r-g  plane  as  A  increases.  The  curve  A  shifts  to  the  left  as  g  decreases,  i.e. 
for  a  given  A  smq-ller  the  g  value  smaller  the  x  value  or  larger  the  r  value. 
It  should  be  noted  that  while  for  a  given  g,  r  increases  as  x  increases,  if  we  were 
to  decrease  g,  r  increases  as  x  decreases.  This  is  as  it  should  be  which  can  be  seen 
from  the  curve  cosco  =  — 1. 
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Fig.  52.  Curves  P  -=  0,  inner  stable  and  outer  unstable  periodic  orbits  for  various 

values  of  a. 


(2)  The  upper  set  of  curves  (B)  correspond  to  cosco  =  +  1.  For  a  given  A  and  g 
there  are  in  general  two  values  of  a;  except  at  the  turning  point  which  corresponds 
to  a  Aq  such  that  gQ  =  cos"^''  Aq  .  In  the  r~g  plane,  this  corresponds  to  the  tangential 
value  g^  for  cos  co  =  +  1.  The  curve  for  g  =  I  touches  the  equator  (A  =  0)  at 
X  ^  0.7  (i.e.  r  =  1)  and  higher  the  value  of  Amin,  lower  the  g  and  x  values  which 
again  confirms  the  shape  of  cosco  +  1  as  A  increases. 

We  shall  now  consider  the  contours  P  =  a  constant  in  the  a;  -  A  plane  for  a 
given  gr(=  0.89),  taking  P  as  the  third  dimension.  Remembering  that  the  unit 
in  which  r  is  measured  depends  upon  the  particle  energy  as  we  have  already  seen, 
the  position  of  the  earth's  surface  is  defined  by  rQ{p).  If  we  wish  to  plot  any 
trajectory,  we  must  start  with  some  point  with  a  finite  {rQ,?.Q,q))  and  given  g 
momentum  values  and  trace  the  path  of  the  particle  "forward"  or  "backward"  by 
increasing  or  decreasing  c  This  is  the  only  way  of  plotting  trajectories  of  particles, 
since  there  is  no  meaning  in  plotting  a  trajectory  from  an  infinite  distance  which  is 
the  real  starting  point  for  cosmic  rays. 

EPCR      25 
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To  study  the  path  of  a  particle,  we  first*  take  a  point  {Xq,}.q)  and  assume  a 
certain  direction  of  motion  for  the  particle  at  this  point.  If  the  particle  at  this 
point  experiences  an  acceleration  to  the  right  and  continues  to  do  so  for  x^  Xq, 
it  can  never  return  to  an  a:- value  less  than  Xq  .  Thus  if  (Xq  ,  Aq)  represents  a  point 
on  the  earth,  the  trajectory  can  never  return  to  the  earth.  Of  course  we  have 
assumed  that  the  particle  is  moving  exactly  in  the  opposite  direction  to  which 
the  cosmic  ray  particles  will  arrive  and  this  is  equivalent  to  our  previous  statement 


-0-2 


-0-6  -0-4  -0-2 


Fig.  53.  Contour  diagram  of  the  function  P{x,  A)  for  g  =  0.89  {a  =  0.10). 

that  a  trajectory  can  be  traced  forwards  or  backwards  since  the  equations  of 
motion  are  not  altered  by  replacing  a  by  — a.  A  particular  example  of  such  a 
point  is  Xq  =  0.54,  where  if  a  particle  goes  towards  the  right  in  any  arbitrary 
direction,  it  can  never  return  to  the  initial  value.  Particles  arriving  from  infinity 
at  Xq  would  never  have  intersected  the  earth  earlier.  Also  for  a  given  X  we  can 
find  the  value  ar^axC^)  ^^^  ^minl-^)  for  which  P  is  a  maximum  and  minimum  re- 
spectively. The  loci  of  points  Xj^^^^  (A)  and  x^i^^  (A)  as  we  vary  X  will  be  called  the 
curves  of  maximum  and  minimum  P. 

In  the  region  between  the  curves  of  maximum  and  minimum  P,  the  x  component 
of  the  acceleration  is  directed  towards  the  left.  In  this  case  for  a  particle  starting 
out  from  any  (XqXq)  in  the  region,  there  is  a  possibility  of  returning  to  its  initial 
value  depending  upon  its  momentum  and  its  initial  direction.  Thus  it  is  clear 
that  the  trajectory  of  a  particle  which  never  enters  a  region  of  negative  acceleration 
(and  of  course  negative  energy)  will  be  certainly  allowed  and  hence  will  not  be 
subject  to  the  shadow  effect  of  the  earth.  We  shall  now  obtain  the  condition  for 
a  trajectory  to  pass  only  through  regions  of  positive  acceleration. 
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Since  the  question  of  the  shadow  effect  of  the  earth  arises  only  when  a  particle 
experiences  a  negative  acceleration  in  some  region  along  its  trajectory,  we  now 
plot  the  curve  d  Fjdx  =  0  in  the  r-g  plane  for  a  given  X.  This  is  given  by 

cos^  X 
2g  =  r^^ (6) 

r  ^ 

and  marks  out  the  region  of  negative  acceleration  in  the  r-g  plane,  as  is  seen  in 

the  figure.  The  following  geometrical  features  for  a  given  X  are  obvious  (see  Fig.  51 )  .^ 

(i)  The  two  branches  correspond  to  Pj^^x  and  Pj^in  and  the  tangential  value  of  g^ 

is  0.8774  cos^/^A  at  r^  =  — — ^y^.  Thus  g^  is  always  less  than  g^  =  (cos'^^A). 

l-dlb       ^p 

(ii)  The  curves  cosco  =  +  1  and-_— =  0  intersect  atg  =  cos*/^A  and  r  =  cos'/='.?., 

i.e.  at  (^;i,  r;i).  ^  ^^ 

(iii)  The  cos  o)  value  on  any  point  on  the  curve  — —  =  0  is  given  by 

cos  CO  =  sin  6  =  r^/cosA 

as  obtained  by  substituting  for  2^  in  the  general  formula  for  cosco.  The  basic 
features  of  the  simple  shadow  cone  may  now  be  qualitatively  explained  by 
applying  arguments  similar  to  those  used  in  the  case  of  negative  energy. 

(a)  Since  the  trajectory  of  a  particle  is  not  subject  to  the  shadow  effect  if  it 
never  enters  a  region  of  negative  acceleration,  the  critical  value  of  g  in  analogy 
with  the  arguments  for  the  main  cone  is  now  given  by 

gr.j  =  0.8774  cos'/^o 

and  for  particles  with  r^  <  cos'^^  Xq  trajectories  with  X  ^  Xq  are  always  allowed. 

(b)  However  a  condition  similar  to  the  Stormer  condition  (i.e.  g  ^  1  m.  this 

case  gs2  =  0.8774)  has  no  particular  significance  since  once  within  a  region  of 

negative  acceleration  there  is  no  simple  method  of  determining  the  allowed  and 

forbidden  directions.  This  is  due  to  the  fact  that  the  trajectory  of  a  particle 

subject  to  a  negative  acceleration  is  not  necessarily  forbidden  since  before  the 

particle  velocity  becomes  zero  it  may  enter  a  region  of  positive  acceleration  and 

will  therefore  be  allowed.  It  is  this  complicated  structure  which  gives  rise  to  the 

penumbra.  Thus  we  have  the  following  values  of  g  which  define  the  boundary 

points.  ^  . 

.2  cos  /q. 

(1)  gr  =  1  which  defines  the  Stormer  cone  with  cos  co  =  sin  6  == j—  —  — ^ — 

within  which  all  directions  are  forbidden.  o  o  o 

(2)  gQ  =  cos'/"  Xq  which  defines  the  boundary  of  the  main  cone  within  which 
all  directions  are  allowed,  except  those  due  to  the  simple  shadow  cone. 

(3)  The  cone  with  g^  =--  0.8774  cos'I^Xq  which  defines  a  cone  within  which  all 
directions  are  certainly  allowed. 

(4)  The  cone  — ^  =  cos  co  which  is  either  more  or  less  restrictive  than  the 

cos  Ac 


1  G.  Lemaitre  and  M.  S.  Vallarta,  ibid. 
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main  cone  depending  upon  the  r  value,  i.e.  the  energy  of  the  particle.  For  r,, 
values  below  r,  9^1/3  1 

— V  > r^  (7) 

since  the  line  parallel  to  the  g  axis  cuts  the  tangent  g  =  cos*/«  Aq  first  and  then 
the  curve  -r —  =  0.  But  for  r^  between  r^  and  cos*'»  Xq 


rj  2co8'/«Ao       cosAq 


cosAq 


(8) 


The  shadow  effect  can  be  explained  in  a  slightly  different  way  as  follows: 

Taking  a  point  on  the  earth  {xq,  Aq),  the  longitude  being  at  present  irrelevant 
(since  the  situation  is  symmetrical  with  respect  to  longitude  about  the  dipole) 


Fig.  54.  Simple  orbit  of  the  second  kind  (B). 
Vertical  line  represents  the  earth's  surface. 
Orbit  A  is  in  shadow.  Orbit  C,  if  non-re- 
entrant gives  an  allowed  direction  at  0  cor- 
responding to  the  interior  of  the  main  cone ; 
if  re-entrant,  orbit  C  may  or  may  not  be  in 
shadow  according  to  the  positions  of  its  sub- 
sequent minima  of  x. 


Fig.  55.  Family  of  simple  orbits 

of  the  second  kind,  x  =  constant 

vertical    line    represents    earth's 

surface. 
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it  is  only  necessary  to  obtain  for  each  {Xq  ,  Xq)  the  family  of  all  orbits  possessing 
sections  OT  like  that  of  orbit  B  in  Fig.  54.   This  is  a  one  parameter  family, 

7]  being  defined  by  tanrj  =  -^ —  1 

defined  by  this  family  provide  the  simple  shadow  cone  for  the  observation  point 
(A,  r).  The  orbits  in  question  have  been  obtained  with  the  aid  of  Bush  differential 
analyses.  To  facilitate  the  finding  of  points  on  the  earth  at  which  the  shadow 
effect  is  felt,  we  adopt  the  following  procedure. 


0-425 


Fig.  56.  Family  of  orbits  which  start  out  from  one  point. 


To  find  the  range  of  A,  when  x  is  specified  as  Xq  when  it  denotes  a  point  on  the 
earth  (the  value  of  x  for  the  earth  is  determined  by  the  momentum)  we  study 
the  behaviour  of  the  trajectories  which  intersect  the  earth  again,  as  we  vary  A. 
We  have  noted  that  the  trajectory  stemming  from  the  point  0  (Xq,  Xq)  touches 
the  line  x  =  Xq  at  {Xq,  X]).  No  shadow  effect  will  be  felt  if  there  exists  no  Af  cor- 
responding to  a  given  Aq.  We  shall  now  investigate  the  limits  of  At.  This  is  done 
by  studying  trajectories  which  start  tangential  to  a;=  otq  at  At.  As  we  increase  At, 
we  reach  a  point  Aj  where  the  trajectory  is  essentially  to  the  left  of  a;  =  Xf,.  The 

dP 


point  {Xq  ,  Ai)  represents  obviously  the  point  on  the  contour  at  which 


dx 


=  0. 


( 


The  a;- acceleration  of  the  particles  starting  tangentially  below  A^  is  initially 
towards  the  right  and  above  A^ ,  towards  left.  As  we  decrease  A^,  we  reach  a  point  Ag 
such  that  a  trajectory  tangential  to  x  =  Xq  reaches  the  boundary  P  =  0  and 
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reverses.  This  is  called  the  self-reversing  orbit  and  at  Ag  =  0,  it  can  be  identified 

to  be  the  outer  principal  periodic  orbit.  To  obtain  the  points  A^  and  Ag  for  various 

dP 
values  of  g,  we  have  drawn  the  loci  of  zero  a;-acceleration  or  — —  =  0  and  zero 

;r- velocity  which  are  shown  in  Fig.  51. 


dx 
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Fig.  57.  Family  of  asymptotic  orbits  for  g  =  0.85. 


Fig.  58.  The  observer's  hemisphere  at 
latitude  A  showing  schematically  the 
allowed    and  forbidden  directions  of 
incidence  for  a  given  rigidity. 


Thus  the  existence  of  the  shadow  cone  depends  upon  the  possibility  of  finding 
a  At.  If  Ai  and  Ag  coincide,  no  shadow  effect  can  be  felt  on  the  earth  with  respect 
to  the  particles  of  that  g  value  which  yields  the  limiting  A  values.  If  there  exists 
no  self-reversing  orbit,  i.e.  (x,  Ag)  lies  on  P{x,  A)  and  if  A^  and  Ag  coincide  the  point  Ag 
corresponds  to  zero  a:-velocity  and  no  analytical  expression  can  be  found.  Since 
an  analytical  expression  can  be  found  for  A^  and  not  for  Ag  we  merely  find  the  value 
of  g  when  A^  lies  on  P{x,  X)  =  0 ;  we  observe  that  A^  =  Ag  is  also  the  poiat  of  inter- 
d^x 


section  of 


da2 


0  and  P  = 

dP 
dx 


0. 


=  0     yields     2g  =  r^ 


cos^  A 
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and  P  =  0  yields  r^        I  cos^  X        X^ 


/cos^/t         Y 


4g^ 

These  two  curves  intersect  for  r^  =  cosA,  g^  =  r^. 

The  above  discussion  is  only  to  facilitate  the  finding  of  points  at  which  the 
shadow  effect  can  be  felt.  For  each  chosen  value  of  g  and  each  chosen  value  of  x, 
we  obtain  a  curve  giving  ^  as  a  function  of  A.  From  these  curves,  diagrams  are 
then  constructed  giving  ry  as  a  function  of  r  for  each  A.  Once  we  are  in  possession 
of  these  curves,  we  are  in  a  position  to  plot  the  simple  shadow  cone  at  a  place 
of  given  latitude  A.  The  coordinates  of  a  point  on  the  loci  are  given  by  smO, 
sinrj  cos  6. 

In  addition  we  have  a  class  of  orbits  which  do  not  extend  to  infinity  at  all, 

which  are  defined  for  obvious  reasons  as  periodic  orbits.  The  simplest  periodic 

orbits  we  can  think  of  are  those  in  the  equatorial  plane.  For  example  we  can 

show  that  a  particle  with  a  given  ^- value  can  describe  a  circle  of  radius  r  =^  g 

without  moving  out  of  the  orbit  in  the  equatorial  plane.  This  follows  immediately 

dw 
from  the  definition  since  g  ==  r^— — .  For  a  circular  motion  rdw  =  ds  and  hence 

OS  ^ 

r  =  g.  In  particular  gr=l,r==lisa  periodic  orbit.  This  periodic  orbit  can  be 
represented  by  a  point  in  the  r-X  plane  since  the  particle  is  confined  to  the 
equatorial  plane.  A  few  important  features  corresponding  to  gr  =  1,  r  =  1  can 
be  mentioned.  Examining  the  equation  (1.6)  for  the  motion  in  the  equatorial 
plane,  we  find  since 

2  =  2'  =  0     and    Q  =  r 


(p  =  (Po  = 


/(-^)- 


1  /^       1  r 


so  that  (p,  the  angle  described  by  the  meridian  plane  since  the  particle  started 
its  course  from  infinity  tends  to  infinity  as  r  ^.  1  for  g  =  I.  This  obviously  means 
that  the  particle  spirals  round  the  circle  r  =  1  an  infinite  number  of  times  and  so 
if  we  ask  for  the  angle  described  about  the  meridian  plane  as  r->  1,  it  must 
clearly  be  infinite.  This  would  mean  that  if  a  periodic  orbit  is  described  by  a 
particle,  any  slight  displacement  would  make  it  describe  the  asymptotic  orbit  and 
it  may  reach  infinity.  Thus  the  orbit  g=l,  r=l  is  "unstable".  Any  periodic  orbit 
is  called  unstable  if  there  exist  asymptotic  orbits  which  can  be  traced  to  infinity. 
For  simplicity  we  have  mentioned  a  class  of  periodic  orbits  in  the  equatorial 
plane.  Stormer  has  shown  that  three-dimensional  periodic  orbits  can  also  exist 
and  he  has  derived  in  particular  the  two  principal  periodic  orbits  whose  charac- 
teristics are  as  follows.  They  reach  a  point  {x,  X)  at  which  P  =  0,  i.e.  the  velocities 
become  zero.  These  two  are  represented  in  the  contour  diagrams.  Note  that  the 
three-dimensional  periodic  motion  represents  an  oscillatory  motion  along  curves  1 
and  II  in  (Fig.  51).  The  orbit  nearer  the  dipole  is  stable  while  the  other  is  unstable. 
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4.  LATITUDE  EFFECT 

In  view  of  the  multiplicity  of  conditions  imposed  and  the  complicated  nature 
of  the  orbits,  it  may  seem  difficult  to  determine  the  variation  of  the  directional 
intensity  of  radiation.  But  on  the  assumption  that  the  particles  are  initially 
isotropic  in  space,  it  was  realized  by  Lemaitre  and  Vallarta  that  as  an  immediate 
consequence  of  the  Liouville  theorem  in  classical  dynamics,  the  spatial  distri- 
bution remains  isotropic  in  allowed  directions.  Hence  to  compute  the  intensity 
of  radiation  at  any  latitude  we  need  only  know  the  solid  angle  corresponding  to 
the  total  allowed  cone. 

We  shall  first  state  the  Liouville  theorem  in  a  form  which  immediately  reveals 
how  the  initial  assumption  of  isotropy  turns  out  to  be  fortuitous  in  the  application  ^ 
of  the  theorem. 

Let  the  motion  of  a  particle  be  represented  by  a  trajectory  in  a  six-dimensional 
phase  space.  If  it  is  represented  by  a  point  Pq  in  phase  space  at  time  <  =  0,  it 
goes  over  to  P<  at  t.  Similarly  every  point  goes  over  to  a  corresponding  point,  the 
transformation  being  one  to  one.  If  Pq  is  a  measurable  set  of  points,  then  the 
measure  of  Pq  coincides  with  the  measure  of  Pi,  i.e.  the  measure  of  measurable 
point  sets  is  an  invariant  under  this  transformation.  If  now  we  have  an  isotropic 
distribution  of  particles,  it  corresponds  to  an  uniform  distribution  of  points  in 
phase  space  so  that  once  we  assume  particles  are  isotropic  in  one  region  of  external 
space,  the  motion  is  such  that  isotropy  is  maintained  in  all  allowed  directions. 
Thus  the  total  intensity  of  radiation  corresponding  to  a  particular  momentum 
is  determined  if  we  know  the  angle  of  the  allowed  cone. 

The  total  density  of  radiation  corresponding  to  particles  of  momentum  f,  at 
a  given  latitude  X  is  thus  given  by 

i{v^X)=-  I  i(e,p,X)dQ,  (1) 

where  dQQ  is  the  solid  angle  around  d  the  directional  intensity  and  A  {p,  X)  defines 
the  total  allowed  cone.  The  overall  intensity  for  all  momenta 

/«=//(?,  A) dp,  (2) 

and  the  number  density  of  particles  of  type  j  is 


^^ji^^Uq, 


(3) 


If  the  distribution  of  particles  is  isotropic  for  all  velocities 


^,-  =  — /,■  (4) 


and  the  flux  of  particles  of  type  j  is 

Fj=  fij  cos6jdQo  =  7iIj  =  -^  VjNj.  (5) 

The  calculated  values  appear  to  be  in  fair  agreement  with  the  observed  ones. 


CHAPTER  XII 

INTERACTION  OF  COSMIC  RAYS  WITH  MATTER 

1.  GENERAL  DESCRIPTION 

Since  primary  cosmic  radiation  consists  of  nuclei  ranging  in  energies  from  10* 
to  10^®  eV,  the  study  of  cosmic  ray  phenomena  implies  the  study  of  the  inter- 
actions of  such  fast  particles  with  air  nuclei  or  more  generally  with  matter.  This 
falls  broadly  into  two  classes : 

(i)  Relating  to  individual  collisions  between  the  incident  particles  and  air 
nuclei,  or  the  secondaries  produced  in  such  colUsions  and  the  air  nuclei.  The 
study  is  based  on  the  theory  of  elementary  particle  interactions  according  to  the 
existing  state  of  quantum  mechanics  which  Part  I  deals  with  in  detail. 

(ii)  Relating  to  cascade  phenomena,  i.e.  the  computation  of  the  cumulative 
effect  of  these  fundamental  particle  interactions  over  finite  thicknesses  of  matter 
assuming  the  cross-sections  for  the  individual  processes. 

We  will  now  deal  with  these  elementary  particle  interactions  with  special 
reference  to  cosmic  rays  and  the  present  status  of  cascade  theory.  Since  the 
beginning  of  the  study  of  cosmic  rays,  cascade  phenomena  have  claimed  the 
attention  of  both  the  theorist  and  the  experimenter.  For  obvious  reasons  the 
different  features  of  this  phenomenon  have  been  studied  separately  as  for  example 
the  longitudinal  development  of  nucleon  cascades  arising  from  nuclear  inter- 
actions, soft  cascades  from  electromagnetic  interactions,  meson  production 
without  reference  to  other  particles  and  lateral  spread  ignoring  other  features. 
In  principle  all  these  features  must  be  imbedded  in  a  cascade  generated  by  such 
high  energy  particles  for  a  better  understanding  of  the  high  energy  component 
of  cosmic  rays.  However  a  quantitative  study  of  this  is  a  formidable  task  for 
two  reasons : 

( 1 )  Exact  cross- sections  for  all  but  electromagnetic  phenomena  are  not  available, 

(2)  The  inclusion  of  all  the  processes  makes  the  problems  mathematically 
intractable  and  numerical  computation  almost  impossible. 

As  an  introduction  to  the  above  study  we  shall  here  give  a  descriptive  account 
of  cosmic  ray  phenomena  associated  with  the  passage  of  a  fast  primary  through 
the  atmosphere. 

The  phenomena  can  be  described  in  most  general  terms  as  follows:  A  fast 
primary  nucleon  enters  the  atmosphere  at  any  arbitrary  angle  to  the  zenith.  It 
collides  with  the  air  nuclei  and  in  a  single  nuclear  colhsion  produces  a  sheaf  of 
mesons  (charged  and  neutral),  nucleons,  sometimes  heavier  fragments  and  strange 
particles.  The  charged  mesons  either  decay  into  /^-mesons  or  interact  with  air 

393 


394  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

nuclei,  the  interaction  between  the  meson  and  the  nucleus  being  of  the  same 
type  as  a  nucleon-nucleus  collision.  The  neutral  meson  either  interacts  with  the 
nucleus,  or  what  is  more  probable,  decays  almost  immediately  into  two  highly 
energetic  gamma  rays  which  generate  the  soft  cascades.  Among  the  strange 
particles  the  charged  A'-mesons,  if  they  have  high  energies,  interact  with  the 
nuclei  producing  the  same  type  of  particles  as  in  a  pion-nucleon  collision  at  low 
energies.  The  A^+-mesons  can  undergo  only  scattering  by  the  nucleons  and  ultimately 
decay  into  pions  or /i-mesons,  the  abundant  mode  being  A^  2  decay.  The  ^--mesons 
at  such  low  energies  can  undergo  a  variety  of  interactions  with  nuclei  resulting 
in  the  production  of  hyperons  and  jr-mesons.  The  K^  particles  will  decay  into 
two  TT-mesons.  The  hyperons  at  very  high  energies  reproduce  the  nuclear  collision 
phenomena  while  at  low  energies  decay  or  interact  with  nuclei  forming  hyper- 
fragments.  The  hyperfragments  in  turn  decay  into  ordinary  nuclei  with  or  with- 
out emission  of  pions. 

High  energy  electrons  are  not  only  produced  in  the  soft  cascades  generated  by 
photons  but  also  by  the  decay  of //-mesons.  The  //-mesons  which  are  highly  ener- 
getic and  weakly  interacting  mostly  decay  but  since  the  decay  time  is  comparable 
to  the  time  for  traversing  the  atmosphere  they  comprise  the  penetrating  compo- 
nents at  sea  level. 

Three  additional  features  have  to  be  included  in  the  description  of  the  develop- 
ment of  such  cascades : 

(1)  The  loss  of  energy  by  ionization  of  charged  particles. 

(2)  The  angular  distribution  of  particles  produced  in  any  collision  (i.e.  they 
need  not  be  collimated). 

(3)  Lateral  spread  due  to  multiple  scattering. 

The  theories  of  nucleon-nucleon  collisions  are  in  a  primitive  stage  in  view  of 
the  strong  coupling  of  the  meson  and  nucleon  fields  and  the  consequent  failure 
of  the  standard  perturbation  methods.  Even  if  we  do  not  invoke  perturbation 
theory,  there  are  serious  limitations  to  any  field  theoretic  calculation  for  the 
production  of  more  than  one  meson  in  a  nucleon-nucleon  collision. 

Only  statistical  approaches  of  doubtful  validity  in  their  quantitative  estimates 
are  available.  As  regards  the  interaction  of  the  strange  particles  the  nature  of 
the  coupling  is  in  a  speculative  stage  and  the  calculation  at  such  high  energies 
available  in  the  cosmic  rays  is  beyond  hope.  Ad  hoc  assumptions  have  to  be  made 
about  the  particles  at  the  point  of  production. 

The  processes  described  above  can  be  schematically  represented  by  the  follow- 
ing diagram  in  which  a  primary  particle  (e.g.  p,  oc,  etc.)  enters  at  A  and  collides 
with  a  nucleus  in  the  atmosphere.  The  following  conventions  have  been  adopted 
in  the  diagram : 

(i)  Particles  produced  in  a  single  collision  are  represented  along  a  horizontal 
line. 

(ii)  The  decay  products  of  a  particle  are  represented  by  lines  diverging  from  a 
point. 
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2.  i^-MESON  IxNTERACTIONS  IN  COSMIC  RAYSi 

The  ^-meson  is  one  of  the  secondary  particles  produced  by  the  passage  of 
cosmic  rays  tlirough  the  atmosphere.  Historically  the  //-meson  was  discovered 
earlier  than  the  .T-meson  and  originally  mistaken  for  the  Yukawa  particle  till 
the  latt<»r  was  identified  to  be  the  jr-meson.  The  cosmic  ray  //-mesons  are  assumed 
to  originate  from  the  decay  of  the  Ti-mesons  in  flight,  the  pions  being  produced 
by  the  int^'raction  of  the  primary  nucleonic  component  with  the  nuclei  of  the 
atmosphere. 

The  //-mesons  have  weak  interaction  with  the  nuclei  and  thus  form  the  pene- 
trating component  of  cosmic  radiation.  They  lose  energy  through  ionization  and 
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Fig.  60.  Spectrum  of  electrons  from  the  decay  of  negative  /z-mesons. 


decay  either  in  flight  or  at  rest  according  to  the  scheme  fx^  ->  e^  +  r  +  v.  How- 
ever the  slow  negative  /t-mesons  can  be  captured  to  form  mesic  atoms.  Such  bound 
mesons  can  either  decay  spontaneously  or  be  captured  by  the  protons  in  the 
nucleus  according  to  the  scheme  ^  +  //"->/i+  v.  Such  nuclear  capture  is  also 
called  forced  decay.  The  probability  of  capture  is  proportional  to  the  square 
of  the  amplitude  of  the  /<-meson  wave  function  at  the  nucleus  and  the  total 
number  of  protons.  If  we  assume  that  the  mesic  atom  is  hydrogen-like,  the 
probability  P  is  given  by 

Pr^ZiZ^'^'fr^ZK  (1) 


^  "The  interaction  of  /^-mesons  with  matter",  by  G.  N.  Fowler  and  A.  W.  Wolfendale, 
Progress  in  Cosmic  Ray  Physics  and  Elementary  Particles,  Vol.  4,  p.  107  (1958).  This  article 
also  contains  extensive  references. 
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A  rough  estimate  gives  the  lifetime  of  the  decay  to  be 

g'— r-.  (2) 


1  5000(210)3    ^m^Z^ 


"^capture  ^ 

The  apparent  lifetime  is  given  by 

1  1  1 


(3) 

"^  ^decay  "^capture 

Eor  Z  =^  II  the  decay  and  captme  lifetimes  are  equal.  The  variation  of //-meson 
with  depth  is  determined  solely  by  the  spectrum  of  the  parent  mesons,  their 
decay  processes  and  by  their  electromagnetic  interaction  with  nuclei. 

If  we  consider  the  //-meson  as  a  heavy  electron  with  no  other  distinct  charac- 
teristics, we  have  only  to  replace  the  mass  of  the  electron  nig  by  that  of  the  meson 
m^^  in  the  cross-sections  for  electrodynamics  processes.  However  this  is  not  true 
though  it  is  well  realized  that  /t-meson  interaction  with  the  nuclei  is  essentially 
electromagnetic.  Following  Fowler  et  al}  we  shall  roughly  classify  these  inter- 
actions as  classical,  i.e.  known  and  non-classical. 

Classical  electromagnetic  interaction :   ionization 

The  Bethe  formula  for  the  loss  of  energy  of  a  heavy  particle  of  velocity  v 
through  ionization  is 


dE 


dx 


=  — ^ log  ^i ^^-  —  p2   erg/cm .  (4) 


}]  being  the  number  of  electrons  per  cm^  and  /  the  average  ionization  potential 
(in  ergs)  of  these  electrons  in  the  absorber  and  ^  =  f/c.  This  formula  differs 
from  that  for  ionization  due  to  electrons  in  the  following  respects.  We 
have  assumed  m^,  to  be  much  higher  than  the  electronic  mass  so  that  the  meson 
traverses  the  matter  practically  undeflected,  i.e.  the  transverse  component  of 
the  momentum  acquired  during  collision  is  negligible.  The  two  particles  being 
different,  we  need  not  consider  exchange  effects.  Experimentally  observed 
values  agree  exactly  with  those  given  by  theory. 

Collisions  involving  free  electrons 

The  electrons  are  classed  as  d  rays  (10-100  keV)  or  knock-on  electrons  (MeV) 
depending  on  their  energy.  These  processes  amount  to  electron-electron  inter- 
action (with  a  ju  replacing  an  electron)  involving  transfer  of  energy  and 
momentum. 

The  mechanism  of  production  of  d  rays  is  as  follows.  A  slow  heavy  particle 
ionizes  an  atom.  An  electron  may  be  ejected  with  sufficient  energy  to  cause 
secondary  ionizations  and  be  observable  as  d  rays.  We  then  have  rays  travelling 


^  Fowler  et  al.,  ibid. 
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away  from  the  path  of  the  incident  non-relativistic  particle.  If  the  incident 
heavy  particle  is  relativistic,  these  rays  have  a  momentum  component  along  its 
path.  The  cross-section  for  6  rays  of  energy  between  co  and  o)  +  dco  is 

N{(o)  do)  — rTft      ^c* — T-dx  (o) 

where  ^  is  the  /i-meson  velocity  and  r^  the  classical  electron  radius.  Numerical 
calculations  using  this  formula  give  approximately  1506  rays  above  20keV 
in  argon  at  52  g/cm^  density  which  agrees  fairly  well  with  experimental  value. 
When  the  energies  of  the  electrons  are  higher,  the  spin  of  the  /^-meson  should 
be  taken  into  account  while  computing  the  cross-section.  Assuming  spin  ^/g, 
it  amounts  to  considering  the  electromagnetic  interaction  between  particles 
of  different  masses.  We  have 

"''"*  E,,^^  =  Ell{E^  +  mJ.c^l2m)  (7) 

the  maximum  energy  transferable  by  a  /^-meson  of  energy  JS7^,.  The  total  observed 
cross-section  of  20  x  10"^®  cm^/electron  agrees  with  the  theoretical  value  of 
18.4  X  10"^®  cm^/electron.  However,  there  was  an  appreciable  excess  of  high 
energy  electrons  over  that  predicted  by  theory.  Attempts  were  made  to  explain 
this  by  associating  an  anomalous  magnetic  moment  d  with  the  ju.  The  cross- 
section  in  this  case  reads 

g(E^.  8)de  =  d^n (—  cA  (1  -  £)  —  (8) 

\m^     J  e 

E 

where  e  =  -p—  while  the  cross-section  for  a  normal  Dirac  particle  is 

^o(E,,  .)d.  =  2J^y^.^.^fl-.  +  a.  (9) 

^  \  m^c^  j    m        E^        8^   \  2  / 

But  general  experimental  evidence  seems  to  rule  out  a  large  anomalous  magnetic 
moment.  There  may  be  corrections  to  the  cross-section  due  to  emission  of  real 
photons,  i.e.  by  Bremsstrahlung. 

Photoproduction 

The  observed  cross-section  for  photoproduction  of  //-meson  pairs  is  much 
higher  than  that  given  by  the  Bethe-Heitler  formula.^  To  explain  the  above 
cross- section  it  was  proposed  that  the  ^  pairs  were  not  all  due  to  photoproduction 
but  some  were  due  to  direct  production  by  jr^-mesons.  Such  a  hypothesis  follows 
from  the  model  proposed  by  WentzeP,  in  which  all  pions  can  be  considered 

^  See  p.  258,  Quantum  Theory  of  Radiation,  W.  Heitler  (Oxford  Press)  (1954). 

2  G.  Wentzel,  Phys.  Rev.  79,  710  (1950).  It  is  to  be  noticed  that  according  to  the  recent 
ideas  on  weak  interactions  n^  cannot  undergo  decay  into  //  pair  as  it  will  involve  violation 
of  angular  momentum  in  n^  rest  system. 
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composite  systems  of  ^  pairs.  However  this  gave  ^  pair  production  to  be  nearly 
6  per  cent  oin  production  whereas  it  should  be  less  than  0.1  per  cent  for  agreement 
with  experiment.  Probability  of  production  of /^  pairs  by  annihilation  of  electron- 
positron  pairs  was  considered.  This  turned  out  to  be  ^/7oth  the  probability  of 
two  photon  annihilation  of  a  40  keV  positron  and  an  electron  at  rest. 

Anomalous  (i.e.  not  Rutherford)  scattering  of /^-mesons  occurs  due  to  the  finite 
size  the  nucleus.  At  large  angles  there  is  deviation  from  Coulomb-scattering  by 
an  uniform  nucleus.  Anomalous  scattering  corresponds  to  multiple  Moliere-scatter- 
ing  within  the  nucleus. 

As  we  have  already  mentioned  by  non-classical  is  meant  the  electromagnetic 
interactions  whose  cross-section  cannot  be  directly  obtained,  like  the  production 
of  stars  and  showers.  The  energy  of  the  ^-meson  may  be  just  sufficient  to  eject 
a  single  proton  from  a  nucleus  or  produce  an  "evaporation"  (cold)  star  or  produce 
mesons  and  characteristic  showers.  All  these  processes  are  due  to  the  interaction 
of  the  electromagnetic  field  of  the  electron  with  the  //-meson  and  can  be  explained 
on  the  basis  of  a  "virtual"  photon  flux. 

We  shall  now  give  a  brief  outline  of  this  process.  Consider  a  high  energy /t-meson 
[v  ^  c)  approaching  a  heavy  stationary  nucleus  of  charge  Ze.  In  the  rest  system 
of  the  //-meson  the  nucleus  moves  with  a  velocity  approaching  c.  Hence  the  elec- 
tric field  E  of  the  nucleus  is  contracted  and  the  perpendicular  magnetic  field  H 
is  about  the  same  magnitude.  Thus  the  nucleus  looks  like  a  plane  electromagnetic 
wave  (corresponding  to  photons  of  various  frequencies)  to  the  //-meson.  These 
photons  undergo  Compton-scattering  by  the  //-meson.  Referring  back  to  the 
laboratory  system  (the  rest-system  of  the  nucleus)  these  scattered  photons  appear 
to  have  been  emitted  by  the  meson.  So  in  all  the  calculations  the  //-meson  is 
effectively  replaced  by  a  "virtual"  photon  flux.  By  "virtual"  we  mean  an  "equi- 
valent" photon  beam  which  has  been  obtained  simply  by  a  Lorentz  trans- 
formation. 

The  equivalent  photon  spectrum  is  given  by 

N,^dkf  =  ^{znKI{z)~Kl{z)]^2zK,{z)K,{z)}^  (10) 

kf 
where  Kq,  K^  are  the  usual  Bessel  functions;  z  =  ni  b^nin'^^  ^min  being  the  least 

impact  parameter  in  this  process.  For  very  high  energies  E  >  m,, ,  6^}^  ^  R  the 
radius  of  the  region  over  which  electromagnetic  interaction  is  strong.  We  then 
have 

i^.d.V^^jlog^--0.38}d.,  (11) 

when  the  perturbed  system  is  a  single  nucleon  R  is  presumably  the  Compton 
wavelength  and 


Nkfdkj^^  hog 0.38   dZ:;.  (12) 
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Thus  cross-section  for  proton  production  is  2.4  x  IQ-*®  cm^  per  nucieon  while 
that  for  star  production  is 

where  £  =^  150  MeV,  the  minimum  photon  energy  required  to  produce  a  star 
with  three  prongs. 

Eq  '-^  energy  loss  up  to  the  depth  in  question.  Prong  distribution  has  also 
been  given. 

Highly  energetic  /i-mesons  produce  pions  and  showers.  There  is  also  a  possibility 
of  A'-mesons  being  produced  in  /^-meson  interaction. 

3.  THEORIES  OF  MULTIPLE  PRODUCTION^ 

Experimental  evidence  from  the  study  of  (i)  jet  showers  and  large  star  events 
in  nuclear  emulsions  and  (ii)  extensive  air  showers  and  in  particular  the  penetrat- 
ing component  in  cloud  chamber  experiments  suggested  the  possibility  of  the 
production  of  more  than  one  particle  in  a  single  collision  between  the  nuclei 
at  extremely  high  energies.  Two  types  of  attempts  have  been  made  to  explain 
this  phenomenon. 

It  is  assumed  that  only  one  meson  is  produced  in  a  single  nucleon-nucleon 
collision  but  a  nucieon  when  passing  through  a  nucleus  produces  an  internuclear 
cascade  by  successive  collisions  with  the  nucleons  in  the  nucleus.  For  simplicity 
the  collisions  inside  the  nucleus  are  considered  independent  and  the  standard 
methods  of  cascade  theory  are  used,  the  total  thickness  of  "matter"  traversed 
being  the  diameter  of  the  nucleus.  Unlike  the  case  of  a  cascade  in  finite  thicknesses 
of  matter  in  the  gross,  it  is  not  possible  to  analyse  the  development  of  the  inter- 
nuclear cascade  at  various  "thicknesses"  but  we  can  only  make  calculations  on 
the  basis  that  the  nucieon  has  passed  through  the  entire  nucleus.  In  other  words 
the  cascade  hjrpothesis  is  just  used  to  compute^  the  cross-sections  for  the  pro- 
duction of  particles  in  the  various  energy  ranges  in  a  single  encounter  between 
a  nucieon  and  a  nucleus. 

Alternatively  it  is  assumed  that  even  in  a  nucleon-nucleon  collision  multiple 
production  is  possible.  The  same  considerations  apply  to  a  collision  with  a  nucleus. 
Since  the  nucleus  is  closely  packed  with  nucleons  the  concept  of  a  cascade  due 
to  successive  collisions  loses  meaning  and  the  collision  has  to  be  considered  as 
a  single  process. 

Recent  experiments  on  collisions  by  artificially  accelerated  nucleons  clearly 
show  that  many  mesons  can  be  produced  even  in  a  single  encounter.  Perhaps 


1  The  theories  of  Multiple  Particle  Production  by  Ziro  Koba  and  Shuji  Takagi,  Fortschr. 
der  Physik  7,  1(1959).  This  article  also  contains  quite  an  extensive  bibliography.  See  also 
Elementary  Particles  by  Enrico  Fermi,  Yale  University  Press.  We  wish  to  thank  Professor 
Z.  Koba  for  his  lectures  while  he  visited  Madras  on  "Theories  of  Multiple  Production"  which 
clarified  many  difficulties. 

-  For  a  detailed  discussion  see  chapter  XIII. 
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very  high  multiplicities  in  collisions  with  nuclei  may  necessitate  a  cascade  hypo- 
thesis with  the  nucleus  though  the  independence  of  successive  collisions  may  not 
be  valid. 

Several  theories  based  on  widely  varying  assumptions  have  been  proposed  by : 
(i)  Heisenberg,  in  analogy  with  multiple  Bremsstrahlung, 
(ii)  Lewis,  Oppenheimer  and  Wouthuysen  by  a  generalization  of  the  Bloch- 
Nordsieck  method  in  electrodynamics, 

(iii)  Fermi,  on  purely  statistical  and  thermodynamic  considerations, 
(iv)  Peaslee  and  Takagi,  on  the  fire  ball  and  isobar  models, 
(v)  Landau  and  Pomeranchuk,  by  an  extension  of  Fermi's  theory, 
(vi)  Sudarshan  and  Srivastava,  by  a  covariant  formulation  of  the  statistical 
and  theory, 
(vii)  Bhabha. 

At  present  no  reliable  calculations  based  on  a  field  theoretic  approach  is  possible 
because  of  the  strong  interactions  involved  in  the  production  of  mesons.  We 
therefore  confine  ourselves  mainly  to  the  theories  of  Fermi  and  of  Landau  and 
Pomeranchuk,  giving  only  a  brief  description  of  some  of  the  other  theories. 

Heisenberg's  theory  of  multiple  production 

The  possibility  of  multiple  meson  production  in  a  colUsion  of  relativistic  nucleons 
was  suggested  by  Heisenberg^  soon  after  Yukawa's  formulation  of  the  meson 
theory  of  nuclear  forces.  He  extended  the  concept  of  Bremsstrahlung  in  electro- 
magnetic theory  to  the  meson  theory  as  follows.  The  sudden  deflection  of  an 
electrically  charged  particle  leads  at  the  moment  of  deflection  to  a  discrepancy 
between  the  electromagnetic  fields  surrounding  the  particle  before  and  after 
the  deflection.  The  difference  between  the  electromagnetic  fields  gets  detached 
and  wanders  away  as  radiation  (Bremsstrahlung).  The  mean  number  of  light 
quanta  driy  in  the  frequency  interval  dco  is  given  by 

dn,  =  e^^.  (1) 

This  leads  to  the  infra-red  catastrophe,  i.e.  the  emission  of  an  infinite  number 
of  very  low  energy  quanta.  In  the  case  of  the  sudden  defiection  of  a  nucleon  the 
corresponding  meson  spectrum  (assuming  it  to  be  of  the  Bremsstrahlung  form)  is 
given  by 

dn„^P--  (2) 

where  p  is  the  coupling  constant  between  the  meson  and  nucleon  and  s  the  energy 
of  the  meson  including  its  rest  mass.  While  the  infra-red  catastrophe  is  avoided 
by  the  finite  mass  of  the  meson  the  large  value  of  s  leads  to  multiplicities  greater 
than  unity  even  for  moderately  high  energies.  Also  unlike  the  electromagnetic 

^  W.  Heisenberg,  Z.  Phys.  101,  533  (1936).  See  also  R.  E.  Marstak,  Meson  Physics, 
McGran  Hill,  New  York  (1952). 
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case  the  strength  of  the  meson-nucleon  interaction  increases  with  energy.  This  has 
been  explained  by  Heisenberg  as  being  due  to  non-linear  effects  which  become 
significant  at  high  energies  causing  a  turbulent  meson  mixing  of  the  nucleon 
wave  packet.^  In  the  course  of  this  mixing  the  meson  spectrum  of  the  nucleon 
wave  packet  imdergoes  a  displacement  towards  low-frequency  leading  to  the 
creation  of  many  low  energy  mesons.  The  qualitative  consequences  of  this  model 
are: 

(i)  A  penetrating  shower  should  contain  more  low  energy  than  high  energy 
mesons. 

(ii)  The  mean  number  of  mesons  produced  should  increase  with  the  available 
nucleonic  energy. 

(iii)  The  form  of  the  meson  spectrum  should  have  very  slight  dependence  on 
the  primary  energy. 

(iv)  The  angular  distribution  of  mesons  should  be  isotropic  in  the  centre  of 
mass  system  in  analogy  with  the  isotropic  distribution  of  small  turbulences  which 
are  discarded  by  a  turbulent  fluid. 

The  production  spectrum  is  given  by 

_  ftp      _ 

n„(E,  ~d)dedD  =  k^dD  (3) 

where  6  is  the  angle  of  emission  of  the  meson  in  the  centre  of  mass  system  and  k 
and  m  are  constants. 

The  L.O.W.  Method 

The  second  major  attempt  to  develop  a  theory  of  multiple  production  was  that 
of  Lewis,  Oppenheimer  and  Wouthuysen^  who  adopted  the  Bloch-Nordsieck 
method  with  suitable  modifications.  The  assumptions  underlying  the  Bloch- 
Nordsieck  method 3  which  are  foimd  to  hold  for  the  electromagnetic  case  are: 

(a)  the  collision  time  is  small  compared  to  the  period  of  the  emitted  radiation, 

(b)  the  recoil  effect  due  to  the  emitted  radiation  on  the  source  can  be  neglected, 
and  (c)  only  those  components  of  the  field  which  satisfy  the  assumptions  (a)  and  (b) 
need  be  considered.  In  this  method  the  emitted  radiation  spectrum  is  calculated 
as  the  difference  between  the  quasi-static  fields  before  and  after  collision.  The  total 
cross-section  is  unchanged  even  though  collisions  are  now  accompanied  by  the 
emission  of  radiation. 

At  first  sight  it  appears  that  the  Bloch-Nordsieck  method  will  be  invalid  due 
to  the  increasing  strength  of  the  meson-nucleon  interaction  with  energy.  However, 


1  W.  Heisenberg,  Z.  Phys.  113,  61  (1939);  126,  569  (1949);  133,  65  (1952).  See  also 
G.Wataghin,  Acad.  Bras.  Ciene,  129  (1943). 

2  H.  W.  Lewis,  J.  R.  Oppenheimer  and  S.  A.  Wouthxtysen,  Phys.  Rev.  73,  127  (1948). 
See  also  H.  Lewis,  Proc.  7th  Rochester  Conference,  IX,  p.  1  (1957).  ♦ 

»  F.  Bloch  and  A.  Nordsieck,  Phys.  Rev.  62,  54  (1937). 


INTERACTION    OF    COSMIC    RAYS    WITH    MATTER  403 

strong  interactions  at  high  energies  do  not  necessarily  imply  a  completely  in- 
elastic collision.  If  there  is  only  a  small  energy  transfer  to  the  mesons  and  no 
large  momentum  changes  of  the  nucleons  the  assumptions  (a),  (b)  and  (c)  can  still 
hold.  The  changes  in  the  nucleonic  charges  and  spins  produce  a  discrepancy 
between  the  initial  and  final  meson  fields  present  around  the  nucleons  which 
can  be  treated  by  the  Bloch-Nordsieck  method.  Neglecting  the  final  state  inter- 
action between  the  nucleons  and  mesons,  the  probability  P^  for  emission  of  N 
mesons  in  a  nucleon-nucleon  collision  is 

The  factor  J  appears  because  the  spin  and  the  isotopic  spin  of  the  nucleons  have 
been  treated  as  classical  variables.  (This  is  a  good  approximation  if  several  mesons 
are  emitted  such  that  the  spin  and  isotopic  spin  average  out.)  J  is  of  the  order 
of  magnitude  unity  and  E  and  p  are  the  total  energy  and  momentum  of  the 
mesons  in  the  centre  of  mass  system.  If  the  mesons  are  emitted  isotropically  p 
is  zero.  The  factor  {E^  —  p^)^-^  appears  mainly  from  a  phase-space  calculation 
taking  into  account  the  pseudoscalar  nature  of  the  meson  and  a  pseudovector 
coupling  for  the  meson-nucleon  interaction.  Assuming  the  value  of  0.08  for  the 
renormalized  unrationalized  pseudovector  coupling  constant  the  ratios  for  the 
production  of  1 ,  2  and  3  mesons  for  a  centre  of  mass  energy  of  the  two-incident 
nucleons  slightly  over  1  BeV  are  1.2  :  1  :  0.45.  This  compares  well  with  the  ex- 
perimental value  of  0.8  :  1  :  0.3  though  the  theory  is  not  designed  for  such  low 
multiplicities.  At  higher  energies  the  agreement  with  experiment  is  much  nearer 
and  the  angular  distribution  of  the  mesons  is  also  well  accounted  for. 

Termi's  statistical  theory  of  multiple  meson  production  ^ 

The  basic  assumptions  of  Fermi's  theory  are :  (i)  The  two  incident  particles 
coalesce  and  the  energy  brought  in  by  them  is  released  within  a  common  region 
of  interaction.  The  volume  Q  of  this  region  is  assumed  to  be  nearly  the  volume 
occupied  by  the  meson  clouds  surrounding  the  nucleon  which  for  a  stationary 
nucleon  is  given  by  3 

Oo=^.  (5) 

Here  R  =  |/^  where  |  is  a  coefficient  of  the  order  of  unity  and  ju  the  meson 

mass.  As  the  nucleons  are  in  motion  their  pion  clouds  in  the  centre  of  mass  system 

undergo  a  Lorentz  contraction  given  by 

2  M 
Q  =  Q„-^  (6) 

where  W  is  the  total  energy  in  the  centre  of  mass  system  and  M  is  the  nucleon 
mass. 


1  E.Fermi,  Progr.  Theoret.  Phys.  5,  570  (1950);  Phys.  Rev.  81,  683  (1951);  Phys.  Rev. 
92,  452  (1953);  93,  1434  (1954). 
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(ii)  After  a  time  this  region  of  interaction  disintegrates  into  one  or  the  other  of 
the  possible  final  states  of  the  system. 

(iii)  The  interactions  involved  are  sufficiently  strong  and  the  interaction 
region  survives  for  a  sufficiently  long  period  for  all  of  the  possible  final  states 
to  have  become  equally  excited,  i.e.  a  statistical  equilibrium  is  attained.  Hence 
the  probability  for  a  given  outcome  is  proportional  to  the  number  of  states 
comprising  the  outcome. 

(iv)  The  ensemble  of  the  possible  final  states  is  to  be  selected  on  the  basis  of 
conservation  laws  which  are  expected  to  hold  during  the  interaction.  Besides  the 
conservation  of  energy,  momentum  and  charge,  the  isotopic  spin,  the  baryon 
number  and  other  similar  variables  should  be  subject  to  appropriate  constraints. 

The  transition  rate  from  an  initial  state  |i>  to  a  final  state  |/>  is  given  by 

co  =  27t\<f\H\iy\^Q(W)  (7) 

where  <,f  \H\  i>  is  the  matrix  element  of  the  Hamiltonian  connecting  the  two 
stat-es  and  q{W)  the  density  of  final  states  of  the  system  for  a  total  energy  W. 
The  final  states  are  specified  by  the  numbers  and  charges  of  the  nucleons  and  mesons 
and  perhaps  by  other  particles  which  emerge  from  the  interaction  region.  To  take 
into  account  the  assumption  of  equilibrium  among  the  various  final  states,  we 
consider  that  the  interaction  H  projects  the  state  |*>  uniformly  over  the  space 
of  all  the  allowed  states  representing  virtual  particles  confined  to  the  volume  Q. 
The  final  state  consists  of  products  of  free  particle  wave  functions  since  all  the 
particles  are  free  in  the  final  state.  If  we  normalize  these  free  particle  wave  func- 
tions in  a  large  volume  V,  the  N  particle  final  state  will  contain  a  factor  {V)~^l*^ 
but  each  of  these  N  particle  virtual  states  is  contained  within  the  interaction 
volume  Q  and  hence  will  have  a  normalizing  factor  {Qyi^^.  Thus  the  matrix 
element  is  /QVU^ 

</ 1 H I  i>  =  constant  (-|^j"    .  (8) 

The  constant,  by  the  assumption  of  statistical  equilibrium  is  independent  of  the 
final  state  and  can  be  ignored  in  the  calculations,  as  we  are  interested  only  in  the 
relative  abundances  of  the  various  final  states  for  a  given  interaction.  If  n  of 
the  N  particles  are  identical,  their  wave  functions  should  be  completely  sym- 
metric (or  antisymmetric)  in  the  exchange  of  the  coordinates  of  the  identical 
particles.  In  this  case  the  matrix  element  is  given  by 


/Q\i^  f  1  V/^ 
^1  i>  =  constant  ^—j      (^— j    . 


</ 1^1  i>  =  constant  (^—j      (^— j    .  (9) 

The  concept  of  indistinguishability  must  be  carefully  applied  in  ascribing  the 
statistical  weights.  For  instance,  if  we  treat  the  positive  and  negative  mesons 
as  different  particles,  the  possible  degeneracies  should  be  allowed  for  by  dividing 
the  matrix  element  by  [n-^ln' !],  w^  and  n-  being  the  number  of  positive  and  nega- 
tive mesons  respectively  in  the  final  state.  However,  if  we  consider  the  positive, 
negative  and  the  neutral  meson  as  the  three  states  of  a  single  particle  characterized 
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by  the  total  isotopic  spin  1,  the  matrix  element  should  be  divided  by  {n\fl\ 
n  being  the  total  number  of  mesons  present  in  the  final  state. 

The  density  of  final  states  q(W)  per  unit  energy  for  N  particles  of  total  energy  W 
will  be^ 

~\(2n)-^^  j j  n^d?Xid?p\.  (10) 

I  vw  ] 

Thus  the  relative  probability  that  n  pions  and  s  nucleons  will  emerge  from  a 
high  energy  interaction  is  given  by 

S(n,  .)  =  (|-)''^^(2jr)-3^JJnd%id'^,i  (11) 

F  W 

where  N  =  n  +  s  and  the  pions  are  assumed  to  be  distinguishable.  The  inte- 
grations should  conform  to  the  constraints  imposed  upon  the  system.  The  con- 
servation of  linear  momentum  reduces  the  independent  variables  from  N  to  N  —  1 
giving  the  relative  probability  S{n,  s)  as 

/0\N-1  A  r  rN-1 

S{n,s)  =  l^yj        S^yyi^nr'^"-"]]  nf^i^'Pi-  (12) 

VW 

The  summation  sign  indicates  that  a  suitable  sum  must  be  taken  over  the  various 
possible  charge  and  angular  momentum  states.  Neglecting  the  conservation  of 
angular  momentum  and  integrating  over  the  space  variables,  we  have 

S{n,  s)  =^  {Q)^-H27i)-^(^-^)  2J  ^Vwin,  s)  (13) 

where  F^  is  the  volume  of  the  3(N — 1)  dimensional  momentum  space  cor- 
responding to  the  N  —  1  independent  particles  which  is  calculated  such  that 
the  total  energy  of  all  the  N  particle  is  ^  IF. 
We  have  to  calculate  the  quantity 


d 


dW 


Vw=-^[n  /d^^^(i2>i)f^(^-|bf+^fr^^)      (14) 


where  the  integrals  are  taken  over  the  momentum  space  of  all  iV^  particles  with 

masses  M^.  The  delta  function  ensures  momentum  conservation  and   U{x)  is 

the  step  function 

U{x)  =1  for         :r  >0 

=  0         for         x<0. 
Expressing  the  delta  and  step  functions  in  terms  of  their  Fourier  representations 

00 

d{x)  =  (2  71)-^  J  d^Ae^^*  (15) 


1  R.  MiLBURN,  Rev.  Mod.  Phys.  27,  1  (1955). 
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and  «;*•      ^ 

U{x)  =  {2ni)-^    J     d<x^ae»«"'  (e  >  0)  (16) 

-oo-if 

and  performing  the  integration,   Lepore  and  Stuart^  deduced  an  expression 
for  the  above  quantity  in  terms  of  the  Hankel  functions, 


dV 


(17) 


where 


-oo-ie 
oo 

/A}  32  N 

"-J^=iA(^  ,18) 

z^  z    dz\      z      / 

and  ^<^^  (2)  is  given  by  the  following  relation 

00 

fdd'  cosh  6'  exp[-  i  2 (cosh  6')]  =  -tt  Z^<i2)(2) .  (19) 

-00 

The  expression  (17)  can  be  integrated  over  A  and  oc  for  the  case  of  zero  mass 
particles,  i.e.  for  the  case  in  which  all  the  particles  are  treated  as  being  extremely 
relativistic.  We  obtain  in  this  case,  on  expanding  the  Hankel  function  about 
the  origin  and  keeping  only  the  first  term 


\  dW  Ier      2*-i    [(2iV-l)!]2(3iV-4)! 


(20) 


It  is  interesting  to  compare  this  result  with  that  obtained  by  Fermi"  who 
neglected  momentum  conservation  and  calculated  the  result  assuming  the 
condition  that  the  upper  bound  of  the  total  energy  is  fixed,  i.e. 

J  Pi  SIT.  (21) 

He  obtained  os^^^lTS^'f 

(V„)f,£,  =  ^-^—.  (22) 

On  imposing  the  condition  of  momentum  conservation  by  reducing  the  number 
of  particles  by  unity  and  taking  the  energy  derivative,  we  have 


/d>VV 
\dW  I 


Fermi         (3iV-4)! 


2S(N-l)]^SN-4  (23) 


which  has  the  same  energy  dependence  as  that  obtained  above,  but  a  different 
coefficient. 


1  J.  V.  Lepore  and  R.  N.  Stuart,  UCRL  2386  Nov.  17  (1953).  Also  Phys.  Rev.  94,  1724 
(1954). 

2  E.  Fermi,  loc.  cit. 
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An  exact  calculation  can  also  be  made  for  the  case  of  non-relativistic  particles 
of  mass  M  whose  kinetic  energy  is  given  by  p^l2M.  A  calculation  similar  to  the 
above  yields  3U-i)  /^       x 

\~dW^)  ~  /o  l/f\3/2  rro/„       TwoT  •  (24) 


{sMfl^r[3{s-l)i'2] 
If  the  particles  involved  have  different  masses,  we  make  the  replacement 

sM-^2]Mi     and     M^^'l^)  ^]J[Mifi^. 


(25) 


It  is  assumed  that  the  momentum  conservation  holds  only  for  the  nucleons 
(treated  as  non-relativistic  particles).  The  produced  mesons  are  treated  as  extremely 
relativistic  (their  masses  are  accounted  for  by  taking  {p  =  ^)  for  the  total  energy 
of  the  pion)  and  they  can  assume  any  energy  consistent  with  the  overall  energy 

conservation.  In  this  case,  the  expression  for  ^  ^^  is  the  product  of  equations  (22) 
and  (23),  i.e.  we  have 

3(8-1) 


dW 


I  dWj. 


approx. 


H^'l^{2n)     2 


(5M)3/2 


[23 ''TT^] 


[Tf -siltf-7l/lp«/2)  +  3n-5/2 


3(5-1) 


+  Zn 


(26) 


Energy  conservation  among  the  increased  number  of  particles  has  been  taken 
into  account  through  the  adoption  of  a  common  energy  factor  and  through  a 
suitable  change  in  the  argument  of  the  delta  function. 

Another  method  is  based  on  a  step-by-step  calculation  which  is  done  as  follows : 
Consider  the  case  of  extreme  relativistic  particles.  We  may  denote  the  total  phase 
space  volume  for  n  such  particles  by  V(n).  Obviously  V{1)  =  0  as  only  one 
particle  is  involved.  The  momentum  volume  available  to  a  particle  of  momentum  p 
which  together  with  another  particle  of  momentum  p'  forms  a  system  having  a 
fixed  total  momentum  P  =  p  +  j>'  is  given  by 

Viw) 

Vw,p  =  27t  f  dpp^      f     d(cos0).  (27) 

0     ■  w  +  w'^W 

Here  w  and  w^  are  the  energies  of  the  particles  and  p^^  =^  p^  -\-  P^  —  2pP  cos0. 
Hence  for  fixed  W  and  P  the  boundary  of  the  region  of  integration  will  be  specified 
by  a  definite  equation  relating  p  and  cos0.  Thus,  for  F(2)  we  observe  that  the 
integration  limits  are  determined  by 

p2  ^  p2  _  2^p cos<9  =  p"^  =  w"^  =  {W-  p)^ 
or 

p  (cos  6) 

which  gives 


JiW-Pcosd) 


(28) 
(29) 


F(2)  =  27r  f  d(cosd)^[p{co8  6)f  =  ^W{W^-P^) 
J  3  b 


(30) 


408  ELEMENTARY    PARTICLES    AND    COSMIC    RAYS 

or  dF(2)      71 


dw)         2 


W^.  (31) 


which  is  the  same  result  as  given  by  the  general  formula. 

It  is  possible  to  adopt  the  above  treatment  to  the  case  of  pion-nucleon  collisions 
resulting  in  the  emission  of  n  pions  (considered  as  massless  particles)  and  a 
single  nucleon  of  mass  M  treated  relativistically.  F(n)  is  calculated  as  for  the 
n  massless  particles  detailed  above  and  the  particle  of  mass  M  is  added  along 
with  the  requirement  that  the  total  momentum  P  equals  zero.  The  momentum 
limit  is  determined  from  the  equation 

2)2  =  {fr-(3f2  +  p2)V.)2  (32) 

or 

^^  -  ^' 
^  =  -2^—  ^^^> 

Then  for  w  =  1 ,  we  have  for  the  momentum  volume 

1  p  (cos  0) 


F(lJf,  10)  =271^01(008  6)    y   p'dp  =  |-r    ^^  I 


(34) 


Since  the  formulae  for  other  cases  are  complicated  we  quote  only  the  energy 
derivatives  for  values  oin=\  and  2 

-^—  (1  M,  10)  =  ^  TF2(i  _  ^2)  (1  _  ^4)  (35) 


71^       W^      {  \  111 

(IJ/,  20)  =  ^r^^— —   (1-^2)   7-43r2_3i;«  +  2?^8+128i;Mog—  etc. 
iV2'5![  L  ^Jj 


dF  .. 71^    W^ 

dw 


(36) 


M 


where  v  =  -^  and  represents  the  relative  effect  of  the  particle  of  mass  M  over 

what  it  would  have  had  as  a  mass  zero  particle. 

These  and  other  similar  formulae  for  higher  values  of  n  permit  one  to  calculate 
the  relative  probability  for  the  production  of  various  numbers  of  mesons  in  pion- 
nucleon  coUisions.  Taking  the  centre  of  mass  energy  as  1.92  BeV  and  the  nucleon 
mass  as  0.938  BeV,  v  =  0.487.  Omitting  also  consideration  of  the  charge  states 
and  the  identity  of  particles,  the  relative  probabilities  8{n,  1)  for  the  emission 
of  n  pions  and  one  nucleon  was  obtained  by  Milburn  as 

«(1,  l)  =  -g^-2-(0-72)  (37) 
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The  total  energy  W  in  the  above  equation  is  expressed  in  momentum  units  and  the 

W        W     M  1  2n 

factor  in  W  becomes  -^r —  =  -rp  -^ —  =  — i—  where  A„  equals  -^rp  =  1.2  x  10-^^  cm, 

A* 

the  extent  of  the  meson  cloud  around  a  nucleon.  Taking  the  nuclear  volume 

4:71 

Qq  to  be  — r—  (X^Y  t^^  results  in  terms  of  the  ratio  of  the  interaction  volume  Q 
to  the  nucleon  volume  Qq  are  obtained  as 

^^(2,     1)  ^r.J^ 


''''''      0.318  (^)  («) 


^(2,1)         ■       \Q^ 
>Sf(3,  1)^(1,  1) 


[8  [2,  1)P 


0.156.  (42) 


It  is  clear  that  the  interaction  volume  plays  an  important  role  in  the  determination 
of  meson  multiplicity. 

For  the  case  of  several  particles  of  non-vanishing  mass,  a  similar  calculation 
gives  the  energy  derivative  for  one  nucleon  and  one  pion  (with  mass)  as 

^^(lJf,lm)  =  ^  (0.714)  (43) 

taking  the  pion  mass  as  0.136  BeV  and  the  nucleon  mass  as  0.938  BeV  for  a  total 
energy  1.92  BeV  in  the  centre  of  mass  system.  The  inclusion  of  the  pion  mass 
has  thus  reduced  the  one  pion  weight  as  compared  to  the  case  of  a  massless  pion, 
equation  (35),  by  a  factor  of  0.99.  Hence  the  ratio  of  the  relative  probabilities 
becomes  „ 

-S(2,  1)/S(1,  l)  =  1.54(^).  (44) 

It  was  suggested  by  Fermi  that  besides  charge  conservation  the  conservation  of 
isotopic  spin  can  be  imposed  upon  the  interacting  pion-nucleon  system.  The  number 
of  pions  and  nucleons  are  combined  to  form  a  joint  system  and  the  total  isotopic  spin 
vector  is  formed  in  the  same  way  as  the  total  angular  momentum  of  the  particles. 
The  resultant  isotopic  spin  is  the  vector  sum  of  the  isotopic  spins  of  the  component 
particles,  i.e.  T^^J'^i-  Conservation  of  charge  and  isotopic  spin  means  that 

i 

T  and  T^  are  good  quantum  numbers  during  the  interaction.  We  shall  here- 
after denote  a  state  with  total  isotopic  spin  T  and  charge  component  M{=  T^ 
by  (T,  M).  Let  us  consider  the  possibility  of  producing  one  more  pion  in  a  pion- 
nucleon  collision.  With  two  pions  and  one  nucleon,  the  state  {^j^,  — ^/g)  for  example 
can  be  formed  in  two  ways  when  the  total  isotopic  spin  T^jiOithe  two  pions  is 
2  and  1 .  In  the  spirit  of  Fermi's  statistical  theory  we  assume  that  these  two  pos- 
sibilities are  equally  excited  and  this  degeneracy  provides  the  (^/g,  — ^/g)  state  an 
overall  weight  of  2.  To  proceed  with  the  calculation  we  must  first  relate  the  various 
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two  pion  states  according  to  their  total  isotopic  spin  To„  and  the  total 
Z  component  Mo„ .  This  relation  is  given  by  tlie  set  of  Clebsch-Gordon  coefficients 
C(l,  1;  M^M^\\,  1;  T<i.„M^„)  evaluated  for  Tg,  =  2,  1,  0.  The  amplitudes  are 
given  compactly  in  an  obvious  notation 

(2,2)      =(++) 
(2,1)      =(+0) 

(2,0)      =i-(-f  _)4-|(0,0) 

(2,-l)  =  (0,-) 

(2,-2)=(--)  (45) 

(M)      ==(+0) 

(1,0)      =(4--) 

(1,-1)=  (+0) 

(0,0)      =|-(+_)  +  i(0,0). 

The  nucleon  can  be  added  to  these  two-pion  states  by  utilizing  the  coefficients 
^(^2«>  V2;  M2„My\To„^l^\  TM).  A  system  in  the  state  (V^,  — V2)  will  form  two- 
pion,  one-nucleon  states  with  the  relative  weights  for  {p )  being  ^j^  and  for 

(w,  0~)  being  ^/g.  The  sum  of  these  weights  is  just  2  representing  the  overall 
degeneracy  of  the  (^/g,  — ^1^  state.  In  practice  these  factors  will  be  nullified  by 
the  reduction  in  the  over  all  weight  arising  from  the  indistinguishability  of  the 
pions.  Similar  calculations  have  been  made  by  Fermi  for  the  products  of  nucleon- 
nucleon  colUsions. 

In  the  foregoing,  statistical  correlations  and  the  conservation  of  angular 
momentum  have  been  neglected.  The  calculations  have  been  simplified  by  Fermi  ^ 
by  a  thermodynamical  treatment  of  the  meson-nucleon  system  in  statistical 
equilibrium.  This  is  specially  convenient  at  high  energies  >(5  x  10^^  eV)  as 
relativistic  thermodynamics  can  be  applied  to  both  the  nucleons  and  the  mesons. 
Since  pions  are  bosons,  Stefan's  law  for  black  body  radiation  holds  except  for 
a  factor  of  ^/g  arising  from  the  fact  that  the  pion  carries  a  statistical  weight 
of  3  because  of  its  positive,  negative  and  neutral  charges  while  the  photon  has 
a  statistical  weight  of  2  due  to  its  two  polarization  states.  Thus  the  energy  asso- 
ciated with  the  ,T-meson  is 


10 


(46) 


where  k  is  the  Boltzmann  constant  and  T  the  temperature.  The  contribution  of 
the  nucleons  to  the  energy  density  is  given  by 

„,.1^3«^(.^,..  ,47) 


1  E.  Fermi,  loc.  cit. 
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The  factor  2  takes  account  of  the  possibility  of  creation  of  antinucleons.  The  tem- 
perature of  the  meson-niicleon  system  is  obtained  by  equating  the  total  energy 
in  the  centre  of  mass  system  to  the  product  of  the  volume  Q  and  the  sum  of  the 
two  energy  densities.  We  have 

Statistical  mechanical  arguments  also  give  for  the  number  densities  of  the  pions 
and  the  nucleons, 

n^  -  0.367  {k  Tf  :  n^  =  0.855  [k  Tf .  (49) 

It  was  shown  by  Fermi  that  the  angular  momentum  conservation  merely  reduces 
the  total  number  of  pions  and  nucleons  by  a  factor  of  0.51.  Hence  the  total 
number  of  nucleons  and  pions  are  given  respectively  by 

1/f  '^^  i=^l/-^-  («» 


0.54 


Thus  this  theory  predicts  the  production  of  comparable  numbers  of  pions 
nucleons  and  antinucleons  at  very  high  energies. 

As  mentioned  above  the  effect  of  angular  momentum  on  the  numbers  of  nucleons 
and  mesons  predicted  by  this  model  is  not  very  serious  but  its  effect  on  the  angular 
distribution  of  the  emitted  particles  is  much  more  critical.  The  effect  is  particularly 
felt  for  large  values  of  the  impact  parameter.  Since  angular  momentum  must  be 
conserved  the  outgoing  particles  must  carry  away  the  large  amount  of  angular 
momentum  associated  with  the  two  nucleons  colliding  with  a  large  impact  para- 
meter {J^hjX).  The  particles  emitted  in  the  direction  of  either  the  incident  or 
the  target  nucleon  in  the  centre  of  mass  system  will  carry  the  largest  amount 
of  angular  momentum.  Thus  forward  and  backward  emission  will  be  preferred 
and  the  angular  distribution  of  the  emitted  particles  will  not  he  isotropic  in  the  centre 
of  mass  system. 

Some  applications  of  Fermi's  theory 

(a)  Calculations  on  Fermi's  theory  applied  to7i:~-^  events  at  1.37  BeV  laboratory 
energy  for  the  tt"  is  given  below.  The  normalized  relative  probabilities  for  three 
values  of  interaction  volume  Q  (given  in  terms  of  the  proton  volume  Qq)  are  Usted 
assuming  that  not  more  than  three  pions  are  produced  in  significant  quantities. 


Relative  production  probabilities  for  1,  2  and  3  pions 
in  TT—p  collisions  at  1.37  BeV  (Lab.  system) 


Q^Qo 


Q  =  0.1SQ. 


Q  -  0.5  Qq 


8{\,  1) 
^(2,  1) 
S{^,  1) 


0.36 
0.56 
0.08 


0.42 
0.51 
0.07 


0.56 
0.42 
0.03 
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It  is  of  considerable  interest  to  list  the  distribution  of  charged  secondaries  and 
compare  with  experimental  data. 

Distribution  of  charged  secondaries  produced  in  n~  —  p  collision 


Charged 
secondaries 

Q-Qo 

D  =  0.78  Qo 

i^  =  0.5I2o 

Experiment 

{n~  +  p)  elastic 
{n~  +  p)  inelastic 

n*  +  27i-  +  p 

0.27 
0.30 
0.40 
0.03 

0.32 
0.29 
0.36 
0.03 

0.45 
0.25 
0.29 
0.01 

0.11 
0.35 
0.50 
0.04 

The  agreement  of  theory  with  experiment  is  not  very  good  since  it  predicts 
many  more  elastic  7t~-p  events  than  what  appear  to  occur.  It  becomes  worse 
when  nucleon-nucleon  collisions  are  considered.  Recent  experiments^  on  meson 
production  in  {fi-p)  and  (p-p)  collisions  at  cosmotron  energies  0.8,  1.5  and 
2.75  BeV  have  shown  a  more  frequent  double  pion  production  than  could  be 
accounted  for  by  the  simple  statistical  theory.  We  give  the  main  results  below.^ 

Multiplicity  of  pion  production  as  a  function  of  beam  energy 


Be^m 
energy 

Experimental  Ratio 
(single  :  double  :  triple) 

Fermi 
(single,  double,  triple) 

Kovacs 
(single :  double) 

0.8   BeV 
1.5   BeV 
2. 75  BeV 

100:    0:    0 
80:20:    0 
36  :  48  :  16 

100:    0:0 
94:    6:0 

78  :  20  :  2 

100:    0 
55:45 

28:72 

We  find  that  the  multiplicities  are  considerably  more  than  those  predicted  by 
Fermi's  theory  but  not  so  large  as  those  of  Kovacs^  whose  calculation  takes  into 
accoimt  the  strong  pion-nucleon  final  state  interaction.  Qualitatively  Kovacs* 
results  are  in  rough  agreement  with  experiment. 

To  explain  the  multiplicities,  Peaslee^  has  advanced  an  "isobar  model"  in  which 
either  or  both  the  colliding  nucleons  become  excited  and  finally,  after  separation, 
they  decay  into  one  or  more  pions.  The  intermediate  excited  nucleon  states  were 
considered  to  be  states  of  angular  momentum  ^/g,  isotopic  spin  ^/g  and  the  excitation 
energy  about  190  MeV,  corresponding  to  the  resonance  peak  observed  in  the  tc-N 
scattering  experiments.  Peaslee^  found  that  his  model  gives  large  double-pion 
production. 


1  M.  M.  Block  et  al.  CERN  Symposium  (1956). 

2  The  table  is  reproduced  from  the  previous  reference. 

3  J.  S.  Kovacs,  Phys.  Rev.  101,  397  (1956) 

*  D.  Peaslee,  Phys.  Rev.  94,  1085  (1954);  Phys.  Rev.  75,  1580  (1954). 
^  D.  Peaslee,  Ihid. 
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Pion-nucleon  and  nucleon-nucleon  experiments  conducted  in  the  energy  range 
5-10  BeV  will  shed  great  light  upon  the  problem  of  multiple  meson  production. 
For  pion-nucleon  case  at  such  energies,  Fermi's  theory  predicts  multiplicities  of 
sufficient  magnitudes  to  enable  one  to  determine  whether  any  particular  charge 
state  is  formed  preferentially.  It  will  also  be  of  interest  to  compare  the  excited 
states  of  nucleons  in  nucleon-nucleon  case  following  Peaslee's  approach  with 
the  excited  states  in  pion-nucleon  collisions.  Conclusions  must  await  the  acqui- 
sition of  experimental  data  of  suitable  quality  and  quantity. 

(b)  Fermi's  theory  is  quite  general  and  deals  with  the  multiple  production  of 
particles  in  a  two-particle  collision  without  any  reference  to  the  detailed  nature 
of  the  interactions  except  through  the  masses  and  generalized  parameters  like 
volume  and  temperature.  Hence  it  can  be  directly  used  even  in  processes  involving 
the  production  of  new  particles  as  for  example  n-  and  iT-mesons  resulting  from 
the  annihilation  of  nucleon-anti-nucleon  pairs.  These  multiplicities  have  been 
calculated  by  Sudarshan^  on  the  Fermi  and  Pomeranchuk-Landau  models  (which 
will  be  discussed  presently)  for  the  annihilation  of  a  nucleon  pair  at  rest.  In  this 
case,  there  is  no  Lorentz  contraction  of  the  volume  in  the  centre  of  mass  system 
and  hence  there  is  no  anisotropy. 

For  the  Fermi  model  we  consider  three  separate  cases: 
(i)  All  mesons  are  relativistic, 
(ii)  All  mesons  are  non-relativistic, 

(iii)  The  iT-mesons  are  non-relativistic  but  the  pions  are  relativistic. 
Then  we  can  immediately  write  down  the  appropriate  densities  of  states  as 

dF   _   1   /7rPf3\^-i {4.n  -  4) ! 

dTF"Tfl"l~"j        (2ri- 1)!  (2ri-2)!  (3/1-4)!  ^     ^^ 


3« 


d F        (8 7^3)   2     .  (^2^ ^nnyu    {W  -njf^niK-  n^ m„)  ^       2 


dW  [n^m,  +  n^mjl^        r(^in^  +  n„-l)) 


(51b) 


dF        m^^(27z;m^)''/^(«if-i)(87r)'^«    {W  -  nj^niK- n^m^fi^'n^h^E)-^ 

(olc) 


d  W  [UK m^yi^  n^inic  +  2n^  -  1)] 

where  n  =  n„+  rij^;  IF  =  2.0;  mji  =  0.520;  m„^  0.148.  (The  unit  of  mass  is 
the  nucleon  mass  and  all  energies  are  expressed  in  terms  of  rest  energy  of  the 
nucleon.)  The  relative  probabilities  are 

P{n„,  nj,)  =  G{0M6QlQX-^-^V{n,nj,)  (52) 


4jr/   1  \3  1 

~5~"  ,  where  — 

in  which  the  annihilation  takes  place.  It  is  to  be  noted  that  Q  does  not  involve 


where i2o=  -tt-x I  ,  where  — ^  =  1.4  x  10"^^  cm^s  the  fundamental  "volume" 

3   Vm^/ 


1  E.  C.  G.  SuDARSHAN,  Phys.  Rev.  103,  258  (1956). 
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any  Lorentz  contraction  factor  and  the  factor  (0.945  QjQq)"  "  ^  is  neglected  in  the 
calculations.  Using  these  expressions,  the  average  and  root  mean  square  multipli- 
cities of  pions  and  of  the  total  number  of  mesons  have  been  computed  by  Sudarshan.^ 
These  are  given  in  Table  I. 

TabU  I 
(a,  b,  c  correspond  to  equations  (51a,  61b,  51c)) 


Quantity 

a 

b 

c 

VP 

np 

PP 

np 

vP 

np 

■sW„> 

1.52 

1.63 

1.74 

1.84 

2.27 

2.43 

vV//« 

1.84 

1.94 

2.12 

2.23 

2.70 

3.01 

vW„  +  njf  > 

2.81 

2.90 

3.27 

3.34 

2.72 

2.84 

<(n„  +  n^)*>Vs 

2.89 

2.96 

3.32 

3.38 

2.95 

3.22 

Pk 

65 

64 

76 

75 

23 

21 

The  results  for  antineutron  annihilation,  namely  nn,  pn  are  identical  to  the 
results  for  pp  and  np  respectively. 

As  has  been  pointed  out  earlier,  the  Fermi  theory  does  not  take  into  account 
the  strong  final  state  interaction  of  the  mesons.  The  Pomeranchuk-Landau* 
theory  which  includes  the  final  state  interaction  has  been  applied  by  Sudarshan  * 
to  the  present  problem.  The  results  of  the  calculation  are  given  in  Table  II. 

Table  II 
(Corresponds  to  equation  (51  c)) 


Quantity 

Casel 

Case  2 

Case  3 

pp 

np 

pp 

n  p 

pp 

np 

<n„  +  n^y 

<{n„  +  nj,ryi^ 
Pk 

3.54 
3.64 
3.59 
3.66 
3 

3.55 
3.64 
3.59 
3.65 
2 

2.94 
3.12 
3.12 
3.21 
9 

3.03 
3.18 
3.18 
3.25 

8 

2.74 
2.98 
3.10 
3.18 
18 

2.84 
3.04 
3.16 
3.22 
16 

Comparison  of  Tables  I  and  II  shows  that  the  pion  multiplicity  of  Pomeranchuk- 
Landau  model  is  nearer  to  the  observed  one  (which  is  3.4)  than  Fermi's. 

The  statistical  models  predict  a  smaller  value  (2.2-3.4)  for  pion  multiplicity 
and  a  larger  number  of  events  involving  iT-emission  than  the  measured  values. 
Koba  and  Takeda*  suggest  that  some  characteristics  of  nucleon  (anti-nucleon) 


^  E.  C.  G.  Sudarshan,  loc.  cit. 

2  I.  Ya.  Pomeranchuk  and  L.  Landau.  Dokl.  Akad.  Nauk.  8SSR.  78,  889  (1951). 

^  E.  C.  G.  Sudarshan,  loc.  cit. 

*  Z.  Koba  and  G.  Tared  a  (Preprint). 
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structure  are  being  reflected  in  the  transition  matrix  elements  leading  to  the  large 
pion  multiplicity.  They  divide  the  pions  resulting  out  of  annihilation  into  two 
classes : 

(i)  those  coming  from  the  pion  cloud  and  (ii)  those  produced  by  the  violent 
annihilation  of  the  core-anticore  pair.  The  essential  part  of  the  core  annihilation 

will  end  within  a  short  period  ^^-^ which  is  smaller  than  the  period  in  which 

a  pion  in  the  cloud  is  absorbed  and  re-emitted  by  the  core  nucleon  which  is  of  the 

order .  Therefore  during  the  short  time  in  which  the  core  annihilation  is  com- 

w„ 

pleted,  the  number  of  pions  in  the  cloud  will  be  constant  and  these  having  lost 
their  centres  will  then  be  emitted,  the  number  of  pions  being  2  or  3  on  the  average. 
On  the  other  hand  during  the  violent  annihilation  process,  three  or  more  pions 
will  be  produced.  Koba  and  Takeda  calculate  the  pions  emitted  from  the  cloud 
on  the  basis  of  the  static  theory  and  apply  the  statistical  theory  to  the  core 
annihilation.  The  mean  number  of  pions  emitted  from  the  cloud  turns  out  to  be 
1.96.  If  this  is  added  to  the  usual  number  given  by  the  statistical  theory  the  agree- 
ment with  experiment  is  very  good. 

Extension  of  Fermi's  theory  by  Landau  and  Pomeranchuk^ 

Quantitative  calculations  on  the  basis  of  Fermi's  theory  do  not  agree  very  well 
with  experimental  results,  especially  those  relating  to  angular  and  energy  distri- 
butions. Hence  the  following  improvement  of  Fermi's  theory  was  suggested  by 
Landau.^ 

At  the  moment  of  collision  there  are  produced  a  number  of  particles  in  a  volume, 
the  linear  dimensions  of  which  are  determined  by  the  nature  of  the  nuclear  forces 
and  the  energy  of  the  incident  particles.  It  must  be  emphasized  that  the  concept 
of  the  number  of  particles  has  a  meaning  only  in  a  limited  sense  since  the  density 
of  mesons  and  nucleons  is  very  large  in  the  nucleus.  The  mean  free  path  of  a 
particle  in  such  a  system  is  obviously  small  compared  to  its  dimensions.  This 
system  will  expand  with  time  and  according  to  Landau  the  expansion  process  is 
hydrodynamic  in  character  as  the  smallness  of  the  mean  free  path  allows  us  to 
view  the  motion  as  microscopic,  i.e.  as  the  motion  of  an  ideal  non- viscous,  non- 
thermal conductive  fluid.  Since  the  velocities  of  the  particles  are  comparable 
with  that  of  light,  the  equations  are  those  of  relativistic  hydrodynamics. 

The  total  number  of  particles  does  not  remain  constant  during  the  hydro- 
dynamical  stage  of  expansion.  The  number  of  particles  emerging  after  the  col- 
lision is  determined  not  by  those  produced  at  the  moment  of  collision  as  was  the 
case  in  Fermi's  theory  but  at  the  end  of  the  expansion  stage  when  they  fly  apart 
as  free  particles.  This  feature  was  suggested  by  Pomeranchuk  and  Landau's 


1  1.  Ya.  Pomeranchuk,  DoU.  Akad.  Nauk,  888R.  78,  889  (1951);  L.  D.  Landau,  Izv. 
Akad.  Nauk,  USSR.  17,  57  (1953). 

2  L.  D.  Landau,  Ibid. 
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theory  which  essentially  consists  in  studying  the  expansion  stage  in  a  quanti- 
tative form. 

The  end  of  the  expansion  stage  comes  when  the  mean  free  path  of  the  particles 
becomes  equal  to  the  linear  dimensions  of  the  expanding  volume.  The  order  of 
magnitude  of  the  temperature  of  the  system  at  this  point  is  given  by 

where  fi  is  the  mass  of  the  meson  and  the  temperature  is  measured  in  energy 
units.  It  is  to  be  noted  that  the  above  relation  is  independent  of  the  nature  of 
the  system,  i.e.  the  energy  of  the  colliding  particles.  When  the  temperature  is 
lower  than  yu,  the  equilibrium  number  density  falls  off  exponentially  e'f^l^  as  the 
system  * 'cools'*.  So  the  mean  free  path  rapidly  approaches  the  linear  dimensions 
of  the  expanding  system.  The  formula  for  T  is  valid  even  when  heavy  mesons 
are  generated. 

We  assume  that  the  equation  of  state  for  the  compressed  matter  at  a  temperature 
T  »  /i  is 

p  =  e/3  (53) 

where  p  and  e  are  the  pressure  and  the  energy  density.  Since  the  number  of 
particles  is  not  given  but  is  defined  by  the  condition  of  the  statistical  equilibrium, 
the  chemical  potential  is  given  by 

C=  e  —  Ts  +  p  =  0  (54) 

where  s  =  entropy  per  unit  volume.  Thus  we  find 

^5  =  -3-£.  (55) 

Making  use  of  the  relation  d  £  =  Tds  for  a  given  volume,  we  obtain  . 

soceI*;  T^eI*.  (56) 

It  is  clear  that  the  entropy  of  the  system  in  the  various  parts  will  remain  constant 
during  the  whole  of  the  hydrodynamical  expansion,  if  we  assume  that  the  motion 
of  the  ideal  fluid  is  performed  adiabatically.  Dividing  the  entire  system  into 
microscopically  small  homogeneous  portions,  let  s^  be  the  entropy  of  one  of  these 
parts  and  n^  the  number  of  particles  in  the  octh  portion  produced  at  the  moment 
of  its  "fljdngaway".  These  times  may  not  coincide  for  different  portions.  The 
ratio  sjn^  depends  only  weakly  on  the  temperature  so  that  we  can  put 

n^  =  constant  x  s^  (57) 

where  the  constant  is  some  universal  constant  and  is  found  to  be  of  the  order 
of /i^.  Thus  we  find  for  all  the  portions 

N  =  constant  x  S  (58) 

where  N  is  the  total  number  of  particles  and  S  is  the  total  entropy  of  the  system. 
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Consider  now  the  "head-on"  collisions  between  two  protons.  Let  E'  be  the  energy 
of  each  proton  in  the  centre  of  mass  system  of  the  two  particles.  If  V  is  the  volume 
in  which  the  energy  is  distributed,  the  total  entropy  of  the  system  S  is  proportional 

to  €*'*  V.  Since  g  =  -=zr  we  have  the  result  that  entropy  as  well  as  the  number 

of  particles  is  proportional  to  j^'"/*  V^l*.  To  calculate  the  volume  F,  we  note  that 
the  transverse  dimension  of  the  system  is  of  the  order  of  the  range  of  the  nuclear 

forces,  i.e.  —  and  the  longitudinal  dimension  in  the  centre  of  mass  system  suffers  a 

Lorentz  contraction  in  the  ratio  MjE'  and  thus  the  system  looks  like  a  strongly 
flattened  disk  with  its  volume  given  by 

V^a^MjE'.  (59) 

Thus  N  is  proportional  to  ^E'  or  in  the  laboratory  system  where  one  of  the  protons 
is  at  rest  and  the  energy  is  E,  we  have 

NocE'l*.  (60) 

This  formula  agrees  with  that  obtained  by  Fermi.  From  dimensional  consideration, 

/    E   VI* 
we  can  express  iV  as  iV^  =  K^  j      where  K  is  a  constant  of  order  of  unity. 

Pursuing  the  above  arguments,  we  can  consider  the  collision  of  two  identical 
nuclei  of  mass  A .  The  number  of  particles  produced  is 

/    E    Vl* 

We  find  from  this  expression  that  heavy  nuclei  are  more  effective  than  protons  in 
producing  the  particles.  K  is  found  to  be  of  the  order  of  2  on  a  comparison  with 
experimental  data. 

The  angular  and  energy  distribution  of  the  particles  produced,  can  be  obtained 
only  by  a  detailed  study  of  the  hydrodynamical  motion  of  the  matter  in  the  system. 
It  has  already  been  remarked  that  relativistic  hydrodynamics  should  be  employed 
since  the  matter  moves  with  velocity  nearly  that  of  Ught.  The  relativistic  hydro- 
dynamical  equations  are  expressed  by  the  relations 

■jgr-o  (62) 

where  T^^  is  the  energy- momentum  tensor  of  matter  and  is  given  for  an  ideal 
fluid  by 

rpik  _  ^gik  +  (e  +  p)u'u^  (63) 

where  u  is  the  four  velocity  and  g^^  =  g^^  =  g^^  =  1  and  gr^o  —  —  i . 

In  the  centre  of  mass  system  at  the  moment  of  collision,  the  system  has  the 
form  of  a  very  strongly  flattened  disk.  This  form  is  preserved  during  the  hydro- 
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dynamical  expansion.  The  motion  of  matter  is  regarded  as  one -dimensional  along 
the  thickness  of  the  disk  (ar-axis).  Then  the  equations  of  motion  become 

w*^*^*re  ^00  _  ^^^0)a  ^  p(^i)2 .         T^i  =  {e  +  p)  u^  u\  (65) 

rpu  ,,  g  (^1)2  ^  p  (^0)2 .  (^0)2  _  (^1)2  _  ^  (gg) 

The  ''disk"  expands  on  both  sides  in  the  centre  of  mass  system  and  hence  we  choose 
the  origin  of  the  coordinate  system  on  the  middle  plane  and  observe  the  motion  in 
the  positive  direction  of  the  a:-axis,  i.e.  a;  >  0,  t^^  >  0.  Denoting  the  initial  thick- 
ness by  zJ ,  we  consider  the  case  when  due  to  expansion  the  thickness  t  :$>  A.  The 
matter  will  occupy  a  space  0  <  a;  <  f.  Most  of  the  matter  moves  with  a  velocity 
of  the  order  of  light.  In  the  narrow  layer  t  —  x  <^  t,  the  matter  is  moving  with 
velocity  nearly  that  of  light  and  it  is  in  this  region  that  most  of  the  energy  is 
concentrated.  Introducing  a  new  variable  S  =  t  —  x  and  in  the  extreme  relativistic 
case  we  have  w°  :^  li^  =  m  »  1  and  u^  —  w^  <^  ^U'^-  The  equations  (62)  then  take 
the  form 

We  shall  look  for  solutions  of  these  equations  in  the  domain  of  values  /  >  <^  >  Zl 
and  we  assume  that  u{S,  t)  shall  be  of  the  form  u^  =  ftjS  where  /  is  a  slowly 
varjdng  function  of  t  and  S.  Finally  we  obtain 

£=  SqQ^    ^  ^  (69) 

w^here 

T=:log«/zl;  r;  =  log^/Zl. 


'-m, 


To  discuss  the  energy  and  entropy  distribution  along  the  thickness  of  the  "disk'* 
we  first  note  that  the  energy  density  is  given  by  T^  ^  eu^.  Putting  u^  ^  tjS 
and  using  the  expression  (69)  we  have  for  energy  in  a  layer  diS 

dE  a  exp[-V3(l/T  -  2]/rj)^]dr}.  (71) 

From  this  it  is  seen  that  the  energy  distribution  is  maximum  at  ?y  =  t/4  and  the 
energy  is  mainly  concentrated  in  the  domain  <^  ^  4  y{tA^).  As  s  ol  e'*  we  obtain  for 
the  entropy  in  a  layer  dS 

d5aexp[— J  (j/T  — l/?y)2]d7y.  (72) 

This  has  a  maximum  at  ?y  =  t  and  unlike  the  energy,  this  is  concentrated  in  the 
region  S  r^t. 
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These  solutions  will  be  valid  only  so  long  as  the  angle  of  flying-away  6,  i.e. 
the  angle  formed  by  the  trajectory  of  the  given  matter  element  with  the  cc-axis  is 
very  small.  Further  if  we  introduce  new  variables  A  and  L  defined  by  ^/a  =  e~^; 

—  =  e~^,  the  entropy  distribution  becomes 

dsoceP^'-^'dX  (73) 

and  the  distribution  of  the  particles  produced  is  given  by 

dN=Cef^'-^'d?.  (74) 

where  (7  is  a  normalization  constant.  The  angle  of  flying-away  is 

^ =e-^  (75) 

and  is  a  constant  with  S  for  each  element  and  so  for  each  particle.  These  expres- 
sions determine  the  angular  distribution  of  the  generated  particles  (in  the  centre 
of  mass  system)  with  a  single  parameter  A.  The  constant  L  is  related  to  the  energy 
of  the  colliding  particles  and  is  found  to  be 


'H-Ar) 


(76) 


where  M^  is  the  mass  of  particles.  The  angular  distribution  given  by  the  expression 

dN 
(75)  does  not  give  spherical  symmetry  as  assumed  by  Fermi  but       „    per  unit 

angle  rapidly  increases  with  decreasing  0.  The  expression  (74)  can  be  rewritten 
for  practical  purposes  in  the  form 

diV^e-^V2/.dA.  '  (77) 

The  angular  distribution  can  be  put  in  the  form  of  a  Gaussian  distribution  if  we 

tan0 
define  A  to  be  —  log — - — .  The  actual  distribution  curve  of  the  particles  in  the 

angle  0  must  possess  comparatively  long  "tails"  on  both  sides  of  the  maximum. 
Landau^  shows  that  the  energy  distribution  of  particles  in  the  centre  of  mass 
system  is  given  by 

/^i*i^Jfexp|-^  +  A  +  ^|/i:2_A4.  (78) 

Finally  we  have  to  carry  out  the  transition  from  the  centre  of  mass  system 
to  the  laboratory  system  in  which  one  of  the  nucleons  is  at  rest.  If  y^  is  the  angle 
of  fljdng  away  in  the  laboratory  system  Landau  shows  that  y^^^'^'^.  X  can 
take  both  positive  and  negative  values  to  account  for  the  particles  flying  in  the 

^  L.  D.  Landau,  loc.  cit. 

27* 
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positive  and  negative  directions  of  x  axis  in  the  centre  of  mass  system.  Similarly 
the  energy  distribution  in  the  laboratory  system  becomes 


fiu^-^Mexp  1^  +  ^  +  I  )/L«-  A«} 


(79) 


with  X  having  both  signs.  The  coefficients  can  now  be  determined  making  use  of 

the  relations  .  . 

fdN  =  N-         jufi&N  =  E  (80) 

and  we  obtain  t^    mk     tut          r  k  r  i  ^ 

From  this  expression  it  is  seen  that  the  majority  of  particles  have  energy  of  the 
order  of  7 

^    ^^^  (82) 


-(^)' 


in  the  laboratory  system.  The  figure  below  shows  the  results  using  the  above 
expression. 
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Fig.  61.  Differential  energy  spectrum  of  secondary  particles  generated  in  nuclear  inter- 
actions at  high  energies  (for  various  energies  of  the  original  particles) .  Areas  under  the 
curves  are  proportional  to  the  total  number  of  secondary  particles  (mesons  and  nucleons) . 


A  covariant  formulation  of  the  statistical  theory 

We  shall  now  describe  a  fully  covariant  formulation  of  the  statistical  theory 
of  meson  production  given  by  Srivastava  and  Sudarshan.^  The  transition  prob- 
ability from  an  initial  state  consisting  of  two  nucleons  of  four-momenta  p[  and  ^2 


1  G.  P.  Srivastava  and  E.  C.  G.  Sudarshan,  Phya.  Rev.  110,  765  (1958). 
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to  a  final  state  involving  two  nucleons  of  four-momenta  p^  and  p^  and  pions  of 
four-momenta  q^,  q^,  -  ■  •  <lm^^  given  by 

Wm  =  f^*Pi  d'^2  ^'qi  '•'d^qdHPi  +  P2  +  qi+---  qm  -  K  -  P2) 

d{pl  -  m^)  d{pl  -  m2)  . . .  d{ql  -  f^^)  . . .  d{q%-  f,^)  f{pp'  q)  (83) 

where  f{pp'q)  is  an  invariant  function  of  the  nucleon  and  pion  four-momenta 
which  is  obtained  by  averaging  over  the  initial  and  summing  over  the  final  spins 
of  the  absolute  square  of  the  matrix  element.  In  the  statistical  theory  we  replace 
/(pp'g)  by  a  constant  quantity  independent  of  the  four-momenta  but  depending 
on  the  number  of  meson  and  nucleon  lines  in  the  diagram  corresponding  to  the 
process.  From  dimensional  arguments  we  have 

t{pp'q)  =  AS{m)K-^'^  (84) 

where  iT  is  a  parameter  of  the  dimensions  of  mass  and  A  a  numerical  constant. 
The  factor  8(m)  takes  account  of  the  constraints  on  the  final  state  due  to  the 
conservation  laws.  The  Compton  wavelength  corresponding  to  the  mass  K 
takes  the  place  of  the  linear  dimensions  of  the  Fermi  volume.  K  can  be  chosen 
so  as  to  fit  the  experimental  data  on  a  nucleon-nucleon  process  at  any  one  energy. 
The  way  in  which  K  enters  the  theory  shows  clearly  that  it  is  a  co variant  constant. 
The  table  below  gives  a  comparison  with  experiment  of  the  meson  multiplicities 
in  proton-proton  collisions  calculated  on  the  basis  of  this  theory. 


Ratio  of 

single:  double:  triple 

proton  energy  ^ 

1.5  BeV 

Ratio  of 

single:  double:  triple 

proton  energy  = 

2.75  BeV 

Present  theory  K  ^  =  1.5  x  10  ^'  cm 
Z-i  =  2.0  X  10-"  cm 

77  :  28  :  3 
77  :  50  :  1 

39  :  29  :  5 
39  :  51  :  17 

Experiment 

80  :  20  :  0 

36  :  48  :  16 

Bhabha's  theory^ 

Bhabha's  theory  of  the  production  of  mesons  and  nucleons  in  very  energetic 
nucleon-nucleon  collisions  is  based  on  the  distribution  of  the  observed  rest 
mass  of  the  nucleon  into  the  proper  mass  and  the  field  mass.  It  is  assumed 
that  a  fraction  e  of  the  observed  mass  M  oi  o,  nucleon  is  distributed  in  some 
arbitrary  manner  over  a  volume  of  radius  of  the  order  of  the  Compton  wavelength 
of  the  meson,  and  the  rest  is  concentrated  in  a  sphere  of  radius  ^,  where  t,  is  of  the 
order  of  the  Compton  wavelength  of  the  nucleon.  The  possibility  of  several  dif- 
ferent types  of  mesons,  viz.  n  and  K  may  be  taken  into  account  by  representing 
the  distribution  of  the  mass  of  the  field  not  by  a  simple  exponential  but  by  a 
superposition  of  exponentials.  Then  it  is  possible   to  calculate  the  amount  of 


H.  J.  Bhabha,  Proc.  Roy.  Soc,  A  219,  293  (1953). 
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energy  available  in  each  region  and  ultimately  in  the  whole  volume  for  shower 
production,  from  general  considerations  using  the  principle  of  conservation  of 
energy  and  momentum.  The  shower  production  takes  place  in  three  distinct 
regions  and  the  total  energ^'^  available  may  be  represented  as  Ef  =  2Eq  where  E* 
corresponds  to  the  region  in  which  the  meson  fields  of  the  two  nucleons  overlap 
and  2i?o  corresponds  to  the  other  two  regions  which  contain  the  spheres  of  radius  C, 
each  of  which  surrounds  one  of  the  initial  nucleons.  It  has  been  pointed  out  by 
Bhabha  that  the  theories  of  Fermi  and  Heisenberg  are  based  on  the  tacit  assump- 
tion that  the  entire  mass  of  the  nucleon  is  located  in  the  field  and  distributed  over 
the  volume  occupied  by  the  field.  In  the  present  theory,  the  total  energy  that  goes 
into  shower  production  in  any  particular  encounter  is  scaled  down  by  a  further 
factor  G  in  addition  to  the  exponential  factor  introduced  by  Heisenberg,  except 
in  rare  head  on  collisions.  Thus,  the  partition  of  the  rest  mass  of  a  nucleon  between 
its  proper  mass  and  field  mass  becomes  a  matter  of  practical  interest  susceptible 
of  experimental  exploration. 

One  can  go  further.  In  Fermi's  theory  the  size  of  a  shower  is  practically  a  unique 
function  of  the  initial  energy  of  the  colliding  particles.  The  impact  parameter  of 
the  collision  only  alters  the  number  of  particles  by  a  factor  of  two  or  so.  In  the 
Heisenberg  theory,  the  size  of  a  shower  is  not  a  unique  function  of  energy  and  for 
a  given  initial  energy  the  size  may  vary  frorti  a  shower  of  one  or  two  particles  to 
that  of  the  maximum  size  possible  for  the  given  energy,  depending  upon  the  im- 
pact parameter.  The  present  analysis  permits  us  to  calculate  the  size-frequency 
distribution  of  showers  produced  by  nucleons  of  the  same  energy,  if  the  spatial 
distribution  of  the  field  energy  or  mass  of  a  nucleon  is  known,  or  conversely,  to 
calculate  in  principle  the  spatial  distribution  of  the  field  energy  of  a  nucleon  from 
the  frequency-size  distribution  of  meson  showers.  Thus  the  study  of  shower 
production  in  high-energy  encounters  becomes  a  method  of  obtaining  experi- 
mental information  on  the  localization  of  field  energy  of  a  nucleon  and  on  the 
fundamental  problem  of  the  self-energy  of  nucleons. 

Isobar  and  fireball  models^ 

The  angular  distribution  of  secondary  particles  with  very  sharp  peaks  in  the 
forward  and  backward  directions  and  the  hypothesis  of  the  possible  existence 
of  a  metastable  excited  state  of  the  nucleon,  the  isobar,  predicted  by  the  early 
strong  coupling  meson  theories,  further  strengthened  by  the  resonance  in  the 
(^/2'  ^U)  state  in  the  (low-energy)  pion-nucleon  interactions,  have  led  to  an  isobar 
model  for  the  multiple  production  of  mesons.  The  other  proposed  model  according 
to  which  the  collision  of  two  energetic  nucleons  gives  rise  to  two  ''fireballs"  each 
of  which  separates  from  the  nucleons  violently  and  after  a  very  short  time 
disintegrates  into  a  number  of  mesons  has  in  common  with  the  isobar  model  the 
assumption  of  a  hypothetical  metastable  state  in  the  intermediate  stage. 

1  Z.  KoBAand  S.  Takagi,  Progr.  Theoret.  Phya.  7,  123  (1952);  W.  L.  Kraushaar  and 
L.  J.  Marks,  Phys.  Rev.  93,  326  (1954);  G.  Cocconi,  Phys.  Rev.  Ill,  1699  (1958). 
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When  two  nucleons  of  momentum  p^  and  p^  collide  with  each  other,  both  mo- 
mentum and  energy  have  to  be  conserved.  In  the  centre  of  mass  system,  the  total 
momentum  is  zero  before  and  after  the  collision.  But  the  magnitude  of  the  in- 
dividual momenta  may  be  less  than  the  magnitude  before  collision.  The  energy 
associated  with  the  motion  and  the  mass  (as  obtained  from  the  energy-momentum 
relation)  will  also  be  less  than  that  before  collision.  This  loss  in  energy  goes  into  the 
making  of  the  metastable  state  of  a  particle  with  a  fictitious  mass,  M*  greater 
than  the  nucleon  mass.  The  two  "isobars"  now  fly  apart  and  decay  independently 
into  secondary  particles.  The  decay  process  can  be  treated  by  a  statistical  method 
though  many  of  the  characteristic  features  of  the  model  do  not  depend  on  this 
approximation.  We  can  assume  for  the  interaction  volume  of  Fermi's  theory, 

a  sphere  of  radius  —  in  the  centre  of  mass  of  the  isobars  which  results  in  a  different 

multiplicity-energy  relation  from  Fermi's  model.  When  the  inelasticity  is  assumed 
to  be  much  smaller  than  one,  the  multiplicity  depends  but  little  on  the  energy  and 
mainly  on  the  inelasticity  since  if  the  latter  is  kept  constant,  the  increase  of  the 
incident  energy  is  used  up  for  the  increase  of  the  translational  motion  of  the  iso- 
bars and  not  to  increase  the  "thermal"  energy  of  the  constituent  particles. 
A  consequence  of  this  motion  of  the  isobars  is  an  angular  distribution  with  rather 
sharp  peaks,  both  forward  and  backward.  For  inelasticity  near  unity,  this  aniso- 
tropy  is  less  pronounced  in  conformity  with  experiment.  If  further  we  take  into 
account  angular  momentum  conservation,  the  isobars  can  have  high  spins  and 
the  decay  products  might  have  a  tendency  of  being  coplanar  in  the  plane  deter- 
mined by  (p  X  p*)  where  p  is  the  incident  direction  and  p*  the  direction  of  emis- 
sion of  the  isobar.  Thus  the  detection  of  an  azimuthal  asjmimetry  in  individual 
jets,  which  cannot  be  accounted  for  as  due  to  statistical  fluctuations,  will  lend 
support  to  the  isobar  and  fireball  models. 

The  isobar  model  puts  too  stringent  a  restriction  on  the  upper  limit  of  multi- 

plicity  N  ^  ^j where  rj  is  much  smaller  than  unity. 


CHAPTER  XIII 

CASCADE  PROCESSES 

1.  INTRODUCTION 

Till  now  we  have  considered  the  various  types  of  interactions  possible  between 
a  fast  incident  particle  of  cosmic  radiation  and  the  individual  nuclei  of  matter 
it  traverses.  We  shall  now  consider  the  cumulative  effects  of  such  interactions 
when  a  single  particle  enters  a  finite  thickness  of  matter.  This  particle  collides 
with  air  nuclei  in  the  atmosphere  and  in  each  collision  a  number  of  other  particles 
are  produced.  These  secondary  particles  in  their  passage  through  matter  collide 
with  other  nuclei  and  produce  further  particles.  Thus  a  cascade  is  formed  and  one 
of  the  main  problems  in  cosmic  radiation  is  to  study  the  nature  of  this  multi- 
plication. In  any  typical  coUision,  particles  of  different  types  at  various  energies 
and  angles  of  emergence  are  produced  and  the  probabilities  of  the  individual 
collisions  are  determined  by  the  corresponding  quantum  mechanical  cross- 
sections.  The  successive  collisions  are  considered  independent  and  cascade  multi- 
plication is  essentially  a  statistical  phenomenon  or  in  more  precise  terms  a 
stochastic  process  which  can  be  studied  by  methods  of  classical  probability. 
A  comprehensive  knowledge  of  the  cross-sections  for  various  processes  may  in 
principle  be  sufficient  for  developing  the  theory  of  such  cascades.  But  this  is 
obviously  a  difficult  task,  too  comprehensive  to  serve  any  purpose.  It  will  be 
convenient  to  study  cascade  development  with  emphasis  on  one  of  the  two 
aspects  given  below. 

Longitudinal  deyelopment  of  showers 

We  assume  that  all  the  particles  produced  move  in  the  same  direction  as  the 
incident  particle.  We  ignore  the  lateral  spread  of  the  particles  and  concern  our- 
selves only  with  diiferent  types  and  energies  as  this  makes  a  stochastic  analysis 
possible.  We  can  deal  not  merely  with  the  mean  values  of  the  various  quantities 
involved  but  also  their  probability  distributions  or  at  least  the  fluctuations 
about  the  mean  number. 

Lateral  spread  of  showers 

We  develop  a  cascade  theory  including  lateral  spread  but  confine  ourselves 
only  to  the  mean  values  of  the  various  quantities. 

To  simplify  the  treatment  and  to  get  a  better  insight  into  the  physical  processes 
involved  it  is  found  more  convenient  to  classify  showers  according  to  the  types 

424 
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of  particles  involved  like  soft  cascades,  nucleon  cascades  and  meson  cascades. 
This  is  only  a  matter  of  convenience  and  in  many  high  energy  interactions  all 
these  types  are  involved. 

The  literature  on  cascade  theory  has  grown  out  of  proportion  ^  to  the  use  it 
can  be  put  to  in  explaining  the  increasing  mass  of  experimental  data.  However 
in  view  of  its  fundamental  importance  in  the  mathematical  theory  of  stochastic 
processes,  it  is  considered  worthwhile  to  discuss  cascade  theory  in  detail  with 
particular  emphasis  on  the  stochastic  aspects  and  the  new  methods  stimulated 
by  such  a  study. ^  It  is  now  possible  to  present  the  entire  cascade  theory  in  a 
concise  and  compact  form  bringing  out  the  relation  between  the  various  methods 
developed. 

2.  ONE-DIMENSIONAL  CASCADES 

General  formulation 

We  shall  first  formulate  in  a  general  manner  a  cascade  process  involving 
different  types  of  particles.  Let  t  be  the  one  dimensional  parameter  representing 
the  thickness  of  matter  traversed.  We  shall  assume  that  at  ^  =  0  we  have  a 
particle  of  type  i  with  energy  Eq  .  This  particle  can  be  called  the  primary  of  the 
cascade.  Our  object  is  to  give  a  statistical  description  of  the  numbers  of  particles 
and  their  energies  at  a  thickness  t  if  we  assume  that  a  particle  of  type  i  and  energy 
Eq  is  replaced  in  a  collision  by  an  aggregate  <^|>  which  consists  of  k^  particles 
of  type  1,  k^  particles  of  type  2,  .  .  .  k^  particles  of  type  n.  The  k^  particles  of  tjrpe  / 
are  distributed  in  energy  as  follows.  One  of  them  lies  in  the  range  between  Ei 
and  E{  +  dEi,  the  second  between  E^^  and  E^  +  dE{,  .  .  .  the  kj^^  between  E{^ 
and  El.  +  dE{..  The  probability  of  its  production  in  dHs 

B^'^dE^ylEojdt.  (1) 

This  cross-section  is  defined  by  quantum  mechanical  laws  and  represents  the 
probability  of  a  collision  of  the  particle  of  type  i  with  a  nucleus  in  its  passage 
through  matter  of  thickness  d^,  the  collision  being  electromagnetic  or  nuclear. 
The  total  cross-section  i2*  (Eq)  is  obtained  by  integrating  over  all  energy  variables 

and  summing  over  all  values  of  kj  and  j  and  multipljdng  by  ^,   ,  .  This  latter 

i 

1  For  an  introduction  to  the  mathematical  aspects,  see  (a)  M.  S.  Bartlett,  An  Intro- 
duction to  Stochastic  Processes,  Cambridge  University  Press  (1955).  For  a  comprehensive 
summary  of  the  works  of  various  authors  on  cascade  processes,  see  Bharucha-Reid,  Markov 
Processes  and  Their  Applications,  McGraw-Hill  (1960).  The  mathematical  methods  used  therein 
and  the  relations  between  them  are  discussed  in  detail  by  Alladi  Ramakrishkan  in 
"  Probability  and  Stochastic  Processes",  in  Handhuch  der  Physik,  Vol.  3/2,  578  (1958). 

2  See  for  the  summary  of  the  methods,  Alladi  Ramakrishnan  and  P.  M.  Mathews, 
Progr.  Theoret.  Phys.  11,  95  (1954). 

.  Messel  and  his  collaborators  have  made  extensive  calculations  on  this  subject  by  carrying 
through  systematically  the  analytic  solution  of  complicated  equations  which  can  be  formally 
written  by  the  use  of  the  new  methods. 
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factor  arises  since  integration  with  respect  to  each  of  the  energy  variables  over  the 
whole  range  implies  counting  an  event  representing  the  occurrence  of  kj  particles 
k^l  times. 

We  have  now  to  define  a  function  which  gives  a  detailed  description  of  the 
cascade  at  thickness  t.  The  most  comprehensive  one  is  obviously  that  which 
defines  the  probability 

7t«E\y\E,,t)d<E\y  (2) 

that  particles  of  the  aggregate  <^j>  lie  in  the  corresponding  energy  ranges  and 
none  elsewhere.  This  function  is  obviously  too  difficult  to  handle.  We  should  be 
normally  content  with  the  function 

nH<Vm>\Eo;t)  (3) 

the  probability  that  we  have  r„»  particles  of  type  m  (m  =  1 ,  .  .  . ,  w)  with  energy 
greater  than  or  equal  to  J^,„  at  t  given  that  at  ^  =  0  we  had  only  one  particle  of 
t\^  i  and  energy  Eq.  The  standard  method  for  the  study  of  the  development 
of  the  cascade  is  to  examine  the  variation  of  this  function  with  t.  This  can  be  done 
only  if  we  know  the  individual  energies  of  the  particles  at  t  as  in  the  case  of 
the  comprehensive  J  function  to  be  defined  later  but  it  is  not  possible  in  the 
case  of  the  tt  function.  However  we  have  chosen  the  tt  function  as  sufficient 
for  our  purpose  and  it  is  only  recently  that  methods  have  been  developed  to  deal 
with  it  directly.  This  method  known  as  the  regeneration  point  method  was  first 
introduced  by  Janossy^  in  his  treatment  of  electron-photon  cascades.  We  shall 
here  present  the  corresponding  treatment  in  the  general  case. 

The  joint  probability  that  nothing  happens  to  the  initial  particle  of  type  i  and 
energy  E^  tiQ  t  and  that  the  aggregate  iEJy  is  produced  in  dr  is 

4-R^*HE,)r]  R(i){^Eiy  Eo)  dr.  (4) 

Somewhere  between  0  and  t  in  the  interval  (t,  t  +  dr)  the  initial  particle  of  type  i 
produces  an  aggregate  and  the  members  of  this  aggregate  become  in  turn  primaries 
of  cascades.  This  aggregate  can  be  formed  anywhere  between  0  and  t  and  therefore 
we  obtain  the  integral  equation 

0    <Ei>  j 

y<i;U^«v^>\<E{>;  t-T)d<,E{ydT 


+  ^-R^\E,)r  JJ  S{Vj)  d{Vi  ^  1)  H{Eo  -  Ei)  (5) 


where  d{vi)  is  the  Kronecker  delta  and  H{E)  the  Heaviside  unit  function.  The 
last  term  merely  indicates  that  nothing  happens  between  0  and  t  and  we  have  the 
initial  particle  of  type  i  with  energy  Eq  . 

1  L.  Janossy,  Proc.  Phys.  Soc.  A  63,  241  (1950).  See  also  R.  Bellman  and  T.  E.  Harris, 
Proc.  Nat.  Acad.  Sci.  43,  601  (1948). 


CASCADE    PROCESSES  427 

^{<v^y\(Eiy;  t  —  r)  is  the  probability  that  we  have  the  aggregate  <i^^>  at  t 
given  that  we  started  with  the  aggregate  iE\y  at  t.  This  is  a  function  of  the  dura- 
tion t  —  T  and  not  of  t  and  t  severally  since  the  process  is  assumed  to  be  homo- 
geneous in  t.  The  probability  frequency  function  7r«a^^>  |<^|>;  ^  —  t)  can  be 
decomposed  in  terms  of  the  probability  frequency  function  with  simpler  initial 
conditions,  i.e.  the  one  particle  initial  condition  as  in  7r*«r^>|^Q;  t)  in  the 
following  manner  if  we  assume  that  the  particles  of  the  aggregate  <^|>  behave 
independently  as  primaries  of  cascade  processes, 

^«vm>  I  <m  \t-r)  =  n  ^^'\<vil>  \E\',t-T)  (6) 

I,') 

where  '  I!<Ky  =  <^m> 

I 

<rj^>  being  the  aggregate  of  ^'J^^  particles  with  energy  greater  than  E^,m=  1,2 ,. . 
produced  by  a  particle  of  type  j  and  energy  E\ . 

Equation  (6)  can  be  reduced  to  a  differential  equation  in  t  by  differentiating 
with  respect  to  t.  We  then  have  a  system  of  simultaneous  integro- differential 
equations  involving  summations  over  Vi.  The  sums  can  be  reduced  to  a  product 
of  suitably  defined  generating  functions  by  the  application  of  convolution 
theorems.  In  spite  of  this  the  equations  are  still  too  general  and  simplifying 
assumptions  have  to  be  made  to  obtain  physically  interesting  results. 

The  simplification  consists  in  (1)  limiting  the  number  of  types  of  particles, 
(2)  assuming  simplified  expressions  for  the  cross-sections  and  (3)  asking  not  for 
the  distribution  function  but  only  for  its  moments. 

If  we  are  interested  only  in  the  mean  numbers,  there  is  no  necessity  to  intro- 
duce the  7t  function  and  thereby  obtain  the  mean.  A  standard  method  of  studying 
the  variation  with  t  of  the  mean  number  of  particles  is  available  in  this  case. 
In  fact  when  the  cascade  theory  was  formulated  in  1937  by  Bhabha  and  Heitler^ 
and  Carlson  and  Oppenheimer^  they  were  interested  mainly  in  the  mean  number 
and  the  fluctuation  problem  was  raised  only  incidentally.  For  the  calculation  of 
moments  it  is  still  possible  to  use  the  standard  method  of  studying  the  variation 
of  the  statistical  distribution  of  particles  with  t  provided  such  a  distribution  is 
defined  by  what  are  known  as  product  densities  which  are  much  easier  to  handle 
than  the  ui  functions.  We  shall  first  define  the  product  density  functions  and  in- 
stead of  describing  the  general  method  of  application,  we  shall  deal  with  two 
important  processes,  i.e.  the  nucleon  and  electron-photon  cascades. 

Method  of  product  density  functions^ 

If  the  particles  are  distributed  in  the  j^- space  which  is  a  continuum,  a  function 
fn [E^ ,  E^,  .  .  .,  En)  called  the  product  density  of  degree  n  is  defined  such  that 
tn(E^,E^,  .  .  .,En)dE^diE^  .  .  .dE^  represents  the  probability  that   there   is  a 

1  H.  J.  Bhabha  and  W.  Heitler,  Proc.  Roy.  Soc.  A  159,  432  (1937). 

2  J.  F.  Carlson  and  J.  R.  Oppenheimer,  Phys.  Rev.  51,  220  (1937). 
^  Alladi  Ramakrishnan,  Proc.  Camb.  Phil.  Soc.  46,  595  (1950). 
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particle  in  d  ^j ,  a  particle  in  d  ^2  •  •  • » ^^^  *  particle  in  d  ^„  irrespective  of  the  number 
elsewhere.  /„  is  not  a  probability  density  but  fn{E^,  J^g*  •  •  •>  En)^E\ ...  d^^  is 
a  probability  magnitude.  Therefore  it  is  possible  to  write  equations  for  /,j  by 
studying  the  behaviour  of  individual  particles  at  ^j ,  ^g ,  .  .  . ,  J^^ . 

If  N(E)  is  the  stochastic  variable  representing  the  number  of  particles  with 
energy  above  E  then 


S{[N(E)Y]=  ZC'.jj---jt.(E„  E„  •••,  E.)dE,dE^.-.dE. 


(7) 


E  E 


where  S  denotes  the  expectation  value  and  CJ  arc  the  coefficients  defined  by  the 
identity  r 

^'■=  i;CJJf(ilf-  \)'-'{M-8+\).  (8) 

4-1 

M  being  a  positive  integer.  The  coefficients  are  obtained  by  setting  M  =  1,2,.... 
It  is  clear  that  for  the  calculation  of  the  n-ih  moment  of  N{E)  a  knowledge 
of  the  product  densities  of  degrees  less  than  or  equal  to  n  is  necessary.  In  cosmic 
ray  problems,  since  we  are  dealing  with  two  types  of  particles  electrons  and  photons, 
we  define  /„ {E^ ,  ^2 '  •  •  • » -^n 5  0  ^^^  9n {^1 ,  E2,  .  .  .,  E^',  t)  as  product  densities  of 
degree  n  of  electrons  and  photons  respectively,  and  fgn,m(^i>  -^2»  •  •  •»  ^n  +  m)  0 
as  the  mixed  product  density  of  degree  (w,  m)  of  electrons  and  photons,  i.e. 

is  the  probability  that  there  is  an  electron  in  dE^,  an  electron  in  d^2'  •  •  •»  ^^ 
electron  in  dE^,  a  photon  in  dE^  +  i,  a  photon  in  dEn  +  2,  •  •  •  ^^^  ^  photon  in 

Since  /„dj^j  d^2»  •  •  •  d^,i  is  a  probability  magnitude,  its  variation  with  t  can 
be  expressed  in  terms  of  the  /,  g  and  fg  functions.  Equations  involving  product 
densities  of  order  greater  than  two  are  not  of  immediate  interest  since  the  analytical 
solution  of  /2,  9^2  ^^^  f9i,i  i^  itself  a  long  and  tedious  process. 

Closely  allied  to  the  product  density  functions  are  the  density  functions  of 
Janossy^  defined  in  an  attempt  to  treat  the  fluctuation  problem  of  nucleon 
cascades.  He  defined  a  function  Jji(E^,  E^,  .  .  .,  Ej^)dE^  dE^  .  .  .dEj^  representing 
the  probability  that  there  is  one  particle  in  dE^,  one  in  dE^,  and  one  in  d^;^ 
and  none  in  the  other  energy  states.  Jj^  is  not  a  probability  density.  Janossy  deals 
with  JJ^|h !  but  it  is  Jj^  that  has  a  physical  significance  since  the  particles  are  in- 
distinguishable . 

The  Janossy  functions  Jj^  are  related  to  the  product  density  functions  ^  as 
follows :  ^ 

h(E,,  E,,  ...,  E,)^2J^^^,ll-ljniE,,  E,,  ....  E„). 

dEj^dE^'-'dEn.     (9) 

*  L.  Janossy,  Proc.  Roy.  Irish.  Acad.  A  53,  181  (1950). 

2  A  detailed  account  of  the  connection  between  f^  and  J^  is  given  by  Alladi  Ramakrishnan 
in  Proc.  Camb.  Phil.  Soc.  48,  451  (1952). 
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The  probability  7i{n,  E)  that  there  are  n  particles  above  an  energy  E  is  given  by 

C»  CO  oo 

n{n,  E)  =  ^jj-"Jjn{E^,  E^,  .-.,  En)  dE^dE^^  -  -  dE^.  (10) 

E  E  E 

Since  it  is  difficult  to  obtain  the  solution  of  J^  for  any  n  (in  fact  it  is  only  possible 
to  obtain  the  solution  for  small  values  oi  n,  by  iterative  methods)  we  have  to 
content  ourselves  with  the  first  few  moments  of  the  number  of  particles  above 
a,  certain  energy.  To  calculate  the  ^th  moment,  we  have  to  know  J^  for  all  values 
of  n  while  we  need  only  product  densities  of  degree  less  than  or  equal  to  'p.  Hence 
for  the  calculation  of  moments  which  is  the  only  feasible  task  we  need  only  the 
product  densities. 

We  shall  first  take  up  the  application  of  the  product  density  technique  to  the 
case  of  the  one  particle  cascade. 

Cascades  involving  only  one  type  of  particle^ 

We  shall  assume  that  the  cascade  consists  only  of  one  type  of  particles,  i.e. 
nucleons.  Given  the  initial  energy  spectrum  of  nucleons  at  ^  =  0  and  that  a  nucleon 
of  energy  E  drops  to  an  interval  between  E^  and  Ei  +  dE^  creating  a  recoil 
nucleon  of  energy  between  E^  and  Ej  +  dE^  with  a  probability  per  unit  distance 
given  by  co{Ei,Ej\E)dEidEj,  our  object  is  to  calculate  the  moments  of  the 
distribution  of  the  number  of  nucleons  above  a  certain  energy.  Since  we  cannot 
distinguish  between  the  recoil  nucleon  and  the  incident  nucleon,  we  can  define 

W{E„  E^  \E)  =  co{Ei,  Ej  \E)  +  co(E^,  E^  \E)  (11) 

where  W {Ei,  Ej  \E)  represents  the  probability  per  unit  distance  that  a  particle 
of  energy  E  is  replaced  by  a  pair  one  of  which  has  an  energy  E^  and  another  Ej . 
Let  us  write  the  diffusion  equation  for  /^ {E-^ ,E^,  . .  .,  En',t)  the  product  density 
of  degree  n,  fniE^,  E^,  .  .  .,  E^;  t)  dE^  dE^  ■  .  •  dE^  representing  the  prob- 
ability that  there  is  a  nucleon  in  the  interval  d^^,  a  nucleon  in  d^g'  •  •  •  ^^^  ^ 
nucleon  in  the  interval  dE„  irrespective  of  the  number  of  nucleons  elsewhere. 
We  have 


dt 


+JfJE[,E'^,-.-,E'„^^,E;t)W{Ei\E)dE 

+  jfn-i(K'>  K'  ■■■'  K'-„  E;  t)  W{Ei,  Ef\E)dE     (12) 


^  M.  S.  Bartlett,  An  Introduction  to  Stochastic  Processes,  Cambridge  University  Press 
(1955);  Alladi  Ramakrishnan.  Proc.  Camb.  Phil.  Soc.  48,  451  (1952). 
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where  W{Ei)  =  J  f  W{Ej,  Et\Ei)dEjd Ej, 

Ej  Ek 

W{Ei\E)  =  fW(Ei,  Ei\E)dEj  (13) 

^i 

and  Ei  for  ik  =-  1,  2,  .  .  . ,  w  —  1,  represents  the  values  Ei^  for  k  =  1,2,  .  .  .,n 
omitting  Ei  and  Ei'  indicates  omission  of  two  variables.  The  equation  is  derived 
by  just  considering  what  happens  to  each  of  the  nucleons  envisaged  in  the  func- 
tion /„.! 

Assuming     that     W(Ei,  Ej  \E)  dE^dEj    can    be    expressed    in    the    form 

W{€i,  Ej)  dsi  dej  where  Si  ^^  -^  and  e,-  =  -^ ,  the  formal  solution  of  these  equa- 
tions can  be  obtained  by  the  Mellin  transform  technique.  Setting 

1  1 
ffWiSi,  ei)d8ider--l  (14) 

0  0 

by  suitably  choosing  the  unit  of  t  and  defining 
1  1 
a{Si,  sj)  =  flW{ei,  £,)£f-if?-M£,d£,.  (15) 

0  0 

c„(«„  s„  ...,  s„;  t)=J...lf,AK  E^,  . ..,  £„;  <)  il'-i. .  .£«-'d£i. .  .d£„  (16) 
we  obtain  the  equation 

o 

-^7C„(5i,   52.    •  •  -   ^n'^   i)  =  -^CniSv  «2.    •  •  •>   «« i   0  +  2/ C,,  (Sp   s'^,    .  .  .,   ^.j,   Sf,  t) 
Ct  i 

X  {a{Si,  1)  +  a(l,  Si)}  +  2J2J  Cn-i{s[',  <',  •  •  .,  Ca^  ^i  +  ^i  "  1;  0«(«i.  «j)-        (17) 

i     j 

In  particular  we  solve  for  c^  and  Cg  in  the  case  of  a  cascade  generated  by  a  single 
particle  of  energy  Eq  : 

Ci(5i;  0  =  ^?-'e-^*^-^  (18) 

A,=  \-a\s^,  l)-a{\,  s^)  (19) 

c,{s„  s,;  t)  =  E^'-'  \a{s     s)  +  a{s,,s,U  ^  j^.^,^,,^_,  _  ^-u,^.^«.).,j         ^20) 

and  the  mean  and  mean  square  numbers  of  particles  above  the  energy  E  is  given 
by  oo 

<^{iV^(^;0}  =  //i(^i;  t)dE, 

E 

oo  oo 

Spf{E;  t)f}  =  S{N{E:  t)]  +  j  j  U{E^,  E^;  t)dE,dE,,.  (21) 

E  E 


1  A  detailed  derivation  of  this  is  given  in  the  section  on  electron-photon  cascades. 
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/i  and  /g  are  obtained  by  inverting  c^  and  Cg  as  follows : 

y  +  ioo 


y-  ioo 


(22) 


We  have  considered  only  a  process  in  which  the  fundamental  cross-section 

is  defined  for  the  production  of  two  particles.  Quite  generally  we  can  define  a 

cross- section^ 

Wn{E^,E^,  ...,E^\Eo)  dE^  dE„  ...,dE^dt 

that  n  particles  are  produced  in  the  ranges  dE-^,dE^,  .  .  .,dEn  and  none  else- 
where by  a  particle  of  energy  Eq  in  passing  through  d^.  The  total  cross-section 
is  given  by  ^ 

"^  =  1!  ^f-  '  JWJE,,  E„  ...,  E,\E,)  dE,...  dE,.  (23) 

El  En 

The  mean  number  of  particles  produced  in  dHn  the  energy  interval  dE^  is  given 

Q{E,\E,)dE,dt  =  Z'^Z^^y  j- .  -JW^E,,  E„  .  ..E^\E,)dE,...dE, 
and  ^2       ^« 

^(^o)  =Jq{E,\Eo)  dE^  =  i;^j. .  .jn  Wn{E„  E^,  .  .  .  E^\E^)dE^. .  .dEn.    (24) 

El  El         En 

From  the  above  equation  it  is  quite  obvious  that  we  can  have  processes  in  which 
the  fundamental  cross-sections  TT^'s  differ  and  are  represented  by  say  W'fi  and 
Wn  but  yield  the  same  q{E^  \Eq). 

For  example  when  W^^Oif  n  ^  1 

and  (25) 

W?  =  Q{E,\E,) 

the  function  representing  the  mean  density  is  also  the  cross-section  for  the  r  ep  lace 
ment  of  a  particle  of  energy  Eq  by  that  of  energy  E^.  The  equations  obeyed 
by  the  product  densities,  including  degree  one,  will  however  be  different  since 
the  total  cross-section  W  is  different  in  both  the  cases,  and  this  W  occurs  in  the 
equations  for  product  densities.  Thus  even  for  writing  the  equations  for  mean  number 
it  is  not  enough  to  know  the  g  for  the  process  but  it  is  also  necessary  to  know  the  detailed 
cross-sections  W^.^ 


^  Alladi  Ramakrishnan  and  K.  Venkatesan,  Proc.  Cosmic  Ray  Symposium,  D.A.E., 
Govt,  of  India  (1961). 

2  This  fact  may  prove  helpful  in  gleaning  the  multiplicative  nature  of  the  cross-section 
even  from  the  mean  number. 
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Electron-photon  cascades 

It  was  in  the  study  of  cosmic  ray  showers  consisting  of  electrons  and  photons 
that  the  cascade  theory  was  first  put  forward  by  Carlson  and  Oppenheimer  and 
independently  by  Bhabha  and  Heitler.  In  their  original  formulation,  Bhabha 
and  Heitler  assumed  only  the  two  quantum  electrodynamic  processes,  Brems- 
strahlung  and  pair  production.  In  the  later  treatments  of  Bhabha  and  Chakra- 
barthy  1,  ionization  loss  was  included.  It  is  customary  to  assume  that  ^dt,  where  ^ 
may  be  a  function  of  E,  is  the  deterministic  loss  in  the  energy  of  a  particle  of 
energy  E  in  passing  through  matter  of  thickness  d^  The  inclusion  of  a  deterministic 
loss  introduces,  as  we  shall  see,  a  new  mathematical  feature  in  the  theory. 

In  view  of  the  comparative  completeness  of  the  theory  of  soft  cascades  we  shall 
summarize  the  theory  following  closely  Bhabha  and  Chakrabarthy.  Though 
considerable  refinements  have  been  made  later,  their  formulation  describes 
all  the  general  featui^s  of  the  behaviour  of  the  soft  component  of  cosmic  rays. 
The  new  refinements  do  not  carry  us  any  further  from  the  point  of  view  of  numeri- 
cal calculations  and  understanding  of  the  physical  phenomena. 

Our  object  in  the  first  instance  is  to  calculate  the  mean  energy  spectrum  of 
electrons  and  photons  emerging  at  thickness  t  defined  by  the  functions  /^  {E ;  t) 
and  gi{E;  t)  such  that  fi(E;  t)  dE  and  gi(E;  t)  dE  represent  the  mean  number 
of  electrons  and  photons  in  the  range  d^  respectively.  We  assume  that  the 
probability  that  an  electron  of  energy  E  drops  to  an  energy  between  E^  and 
E*  -\-  dE'  emitting  a  photon  of  energy  E  —  E'  while  passing  through  matter  of 
thickness  dHs 

T37"(^ir^)  log(183Z-V.i?(i)(J^'|^)d^'  (26) 

ivhere 

E-E'      /4         \/,  E      W  dE' 


/.»(^'|^)d^'  =  jl^-(|+a)(l-^)) 


E 


(27) 


and  Ze  is  the  charge  of  the  nuclei  of  the  material  traversed.  It  is  convenient  to 
■choose  the  characteristic  unit  of  length  for  the  medium  traversed  by  the  electron 

■as  AY2    I      p2      \2 

«o  that  the  differential  cross-section  is  now  given  by  R^^^  .  The  only  parameter 
depending  on  the  material  traversed  is  a.  The  above  is  valid  in  the  case  of  complete 
screening  of  the  nucleus  by  the  surrounding  atomic  electrons.  It  is  to  be  noted 
that  this  cross-section  is  asymmetric  in  E  and  E  —  E'.  The  corresponding  pro- 
bability that  a  photon  gives  rise  to  an  electron  of  energy  E'  and  a  positron  of 
■energy  E  —  E'  with  the  s/»me  characteristic  unit  of  length  is  given  by 

i^-(^'i^)d^'=[i-(4+«)(-f-^)]-^|:.        (29, 


1  H.  J.  Bhabha  and  S.  K.  Chakrabarthy,  Proc.  Roy.  Soc.  A 181,  267  (1943). 
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If  f-^{E\  t)  is  the  density  (product  density  of  degree  one)  of  the  positive  and 
negative  electrons  of  energy  E  and  g^{E;  t)  of  the  photons  of  energy  E  at  t,  they 
satisfy  the  equations 

E 

^'^'  '*  =  -h(E;  t)  JBS^\E'\E)AE' 

0 

oo  oo 

+  fh{E';  t)R^^^E\E')dE'  +  2fg^{E';  t)R^^\E\E')diE' 

E  E 

o 

+  ^^{Mi{E\t)}.  (30) 


dE 


^{E;  t) 
dt 


g^iE'J)  fR^^^{E'\E)dE' 

0 

oo 

+  ffiiE';  t)  R^^^E'  ~  E\E')  dE'.  (31) 

E 

The  factor  2  in  (30)  arises  because 

i2(2)  ^E  \E')  =  i?(2)  (^'  —  E\E').  (32) 

The  first  term  on  the  right  hand  side  of  the  equation  (30)  gives  the  depletion 
of  the  electrons  from  the  range  d^  due  to  radiation;  the  second,  the  influx  into 
this  range  from  electrons  of  higher  energy;  the  third  term,  the  contribution 
from  the  photon  (by  pair  creation)  while  the  fourth  term  is  due  to  ionization 
loss.  This  term  is  obtained  as  follows :  particles  in  the  neighbourhood  of  the  range 

of  width  ZJ 1 1  +  -r-=-dn  at  energy  E  -\-  "^"^df  after  penetration  of  thickness  d^ 

will  have  an  energy  E  in  the  range  A .  Thus  the  difference  between  particles  in  the 
range  of  width  A  in  the  neighbourhood  of  ^  at  thicknesses  t  and  ^  +  d^s  given  by 


h{E  +  pdt;t)^l  +  -^dtJA-fAE;  «)^  =  (/S^  + /i^)^ -g' 


(33) 


Similarly  the  terms  in  the  second  equation  can  be  interpreted. 

Noting  that  R^^^  (E'  \E)  dE'  is  homogeneous  in  E  and  E',  that  is,  it  can  be  written 

E' 
as  i2^^^(£)  de  where  £  =  -^=^  we  can  reduce  the  above  equations  by  the  Mellin 

Ml 

transform.  Defining  the  Mellin  transforms  of  /^  and  g^  as 

oo 

V{s',  t)=  JE'-^h(E\  t)dE  (34) 

0 

oo 

q{s;t)  =  fE'-^g^{E;  t)dE  (35) 

0 

EPCR      28 
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we  find  that  p{s;  t)  and  q{s',  t)  satisfy  the  equations 

-^p{s\  t)==-A,p{8;t)^B,qis',  t)  -  P{s  -  I)  p{s  -  I;  t)  (36) 

^jq{s;t)  =  C,pi8;t)-Dq{s;t)  (37) 

where  } ,    ,  ,  ,      v 

^,  =  /{/?<i^(f)(l-£<'-i))d£ 

0 

£»  =  I  +  -^  a  (38d) 

y  being  the  Euler-Mascheroni  constant. 

The  essential  difficulty  in  solving  the  differential  equation  (36)  for  ^(5;  0 
is  due  to  the  occurrence  of  the  term  p{s  —  1;  t).  Since  5  is  a  complex  variable, 
it  is  not  possible  to  use  any  familiar  iteration  procedure.  Bhabha  and  Chakra- 
barthy  have  obtained  the  solution  for  this  equation,  first  by  obtaining  Po(s;  t) 
the  solution  without  ionization  loss  and  therefrom  the  general  solution  with 

Eliminating  q{s;  t)  from  the  above  equations  (36)  and  (37),  we  have  for 
^Pois;  t)  =  -{A,  +  D)  j^p,{s;  t)  =  {A,D-B,C,)po{s',  t) .         (39) 

This  has  the  solution  ^^^^   ^^  _  ^^  ^-x,t  ^  j^  ^-^,«  (40^ 

ttg ,  bg  being  independent  of  t  and  jUg ,  Xg  being  the  roots  of  the  equation 

Af  -  {Ag  +  D)Xg  +  C{Ag  -BgCg)  =  0,  (41) 

i.e. 

Xg  =  h{Ag  +  D)  +  i  {{Ag  -  Z))2  +  4  Bg  0,}V.  (42) 

fig=i{A  +  J))-i  {(^«  -D)^  +  4.  Bg  C,}'/.  (43) 

and  ttg  and  bg  are  determined  by  the  initial  conditions.  For  a  shower  initiated  by 
an  electron  of  energy  Eq  we  have 

U{E;0)  =  d{E-Eo);         gAE;0)  =  0  (44) 

p{s;0)  =  El-^;         q{8;0)  =  0  (45) 

^{p{s,t)}=-AgEl-\  (46) 
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Imposing  the  above  boundary  condition  we  obtain 

Pois,  t)  =  El-^ip,{s,  t)  (47) 

where 

^o(^,  t)  =  {^^  e-^^^  +  i^^  e-^4  .  (48) 

The  solution  of  (30)  which  includes  ionization  loss  can  be  shown  to  be 

a+'too 

e 

a  -ioo 

where  the  function  ^p,^  [s ;  t)  satisfies  the  recurrence  relation 

t 
Wnis;  t)  ^  j  ip^{s  ^- n;  t-f)y),,_^{s;  t')  df  (50) 

0 

which  can  also  be  written  in  the  form 

t  ,  . 

y^nis;  i)-  fwoi^;  t)yj,,_T^{s  +  l;  t-t')dt'.  (51) 

0 

ipn  and  y)n  _  ^  satisfy  the  equation 

-[-^  +  D)wn-i{s;t),  (52) 

The  series  within  the  curly  brackets  in  the  expression  for  /j  {E ;  t)  is  uniformly 
and  absolutely  convergent  ii  E  ^  fit.  It  also  follows  that  the  integral  on  the 
contour  c  is  convergent,  so  that  f^{E;t)  is  a  well  defined  function  of  E  and  t 
and  hence  the  summation  and  integration  can  be  interchanged. 
We  now  introduce  the  functions 

W^  {s  ;  t) 


and  write 

a  +  ioo 


The  solution  (55)  is  an  analytic  continuation  of  (49)  and  the  restriction  E  y  ^t 

can  be  dropped,  so  that  (55)  is  the  exact  solution  of  (30)  for  all  E  ^  0. 

28* 
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To  get  the  total  number  of  particles  with  energy  greater  than  E  we  integrate 
(55)  from  E  to  Eq  and  obtain 

(T+  too 
O-  lOO 

P(s  4-  n)  \ 

~7>r      '^^^''  ^)}d5.     (56) 
This  expression  can  be  written  as  ; 

e[N{E;l))^-2:NAE:t)  (57) 

wliere 


*^"-i  r(5  +  w) 


r(s) 


K(s;t)ds.  (68) 


Multiplying  equation  (30)  and  (31)  by  JS/  and  integrating  with  respect  to  E  from  0 
to  00  and  adding  these  two  equations,  we  get 

-^\fEf,{E',t)dE-\-fEg,{E;  i)dE\ 
^-^jh(E;t)AE 

0 

=  -PN{0;t).  (59) 

In  other  words,  the  change  in  the  total  energy  of  the  shower  is  entirely  due  to 
collision  loss.  Integrating  (57)  with  respect  to  t  from  0  to  oo ,  we  get  for  a  shower 
initiated  by  an  electron  of  energy  Eq  , 

t 

J  N{0,t)dt  =  -^.  (60) 

0 

We  shall  now  take  up  the  calculation  of  the  fluctuations  about  the  mean 
number.  For  this  purpose  we  have  to  obtain  the  diffusion  equations  for  the  pro- 
duct densities  of  degree  two  and  obtain  therefrom  the  mean  square  number  and 
the  deviation  from  the  mean  number  of  particles  above  a  certain  energy.^  Numeri- 
cal results,  as  complete  as  for  mean  number,  are  available  only  when  we  neglect 
the  ionization  loss.^  The  equations  including  ionization  loss  can  be  formally  written 
down  but  then  exphcit  analytical  solution  has  not  been  obtained  till  now  in  a  manner 
suitable  for  numerical  computation. 


1  H.  J.  Bhabha  and  A.  Ramakrishnan,  Proc.  Ind.  Acad.  Sci.  32,  141  (1950). 

2  A.  Ramakrishnan  and  P.  M.  Mathews,  Progr.  Theoret.  Phys.  11,  95  (1954). 
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Equations  for  /g  and  g^ ,  the  product  densities  of  degree  two  of  electrons  and 
photons  and  mixed  product  density  fg^  ^  can  be  written  as  follows : 

-^^(^1,  E^;  t)=--U{E„  E^;  t)\  lR^^){E\E,)dE  +  fB^^HE\E^)dE\. 

U  0  f 

OO  CX) 

+  f  U{E,  E,;  t)R^^^{E,\E)dE+  I  U{E„  E;  t)  R^^^E^\E)dE 

El  E^ 

OO  OO 

+  2//grii(^i,  E-  t)R^^HE,\E)dE  +  2lfg,^,{E^,  E;  t)  R^^\E^\E)  dE 

E,  E, 

+  2g, {E,  +  E^;t)  R^^^E^ \E^  +  E,)  (61) 

dfg^  {E„  E„  t)  ^  _/     ^(^^^  ^^.  ^^  j  rV)(^|^,)  d^  +  fE^'\E\E,)  d^l 

OO  OO 

+  ffg,,AE,  E,;  t)R^^^{E,\E)dE+  l2gAE,  E,;  t)  R^^^{E,\E)  dE 


OO 


+  IU{E,,E;t)R^^HE-E^\E)dE  +  f,{E^-i-E^',t)R^^\E,\E,  +  E^)  (62) 

df^{E„  E,;  ^)^_fg^^^^E,.  E;  oj />(^|^3)  d^  + /V>(^|^,)  d^  j 
^^  lo  0  ' 

OO  OO 

+  I  fgi,AE,  E,;  t)R^^HE^\E)dE+  J2gAE,  E,;  t)R^^\E^\E)  dE 

E,  E, 


OO 


E, 

El  E^ 


^'^"^^^''^-..(^.^.0 


+  f  U{E2,  E;  t)R^'\E-E,\E)dE  +  fAE,  +  E^;  t)  R^^HE2\A  +  E^)      (63) 

j  R^^^E\Ei)  dE  +  I  R^^^EIE^)  dE 
+  j  fg^,^{E,  E,',  t)R^^HE-E^\E)dE 

OO 

+  //9'i.i(^,  E^;  t)R^^\E-E^\E)dE.  (64) 


It  is  to  be  noted  that  fgi,i(E^,  E^\  t)  is  not  equal  to  /g'i,i(^2J  E^;  t).  The  initial 
conditions  are  defined  as 

f,{E;0)  =  S(E-E,);        g,{E;0)^0,  (65) 

i.e.  at  ^  =  0  there  is  only  one  primary  of  energy  Eq  and 

U{E^,E^;  0)  =  0;     g^iE^,  E^iO)  ^  0;     fg,,,(E„  E,',0)  =  0.  (66) 

The  various  terms  in  the  equations  are  written  down  by  the  now  familiar  argu- 
ments on  studying  the  individual  behaviour  of  the  particles  in  the  various  energy 


I 
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ranges  and  estimating  their  contributions  to  the  product  densities.  It  is  note- 
worthy that  the  equations  for  the  product  density  of  degree  two  contain  on  the 
right  hand  side  terms  involving  product  density  of  degree  one.  In  fact  these  are 
the  terms  that  define  the  correlation  between  the  particles  in  the  two  energy 
ranges. 

(i)  The  term  2g^{Ej  +  E^\  t)  R^-^E^  \  E^  -(-  E^)  in  (61)  represents  the  contri- 
bution of  product  density  of  degree  one  to  the  product  density  of  degree  two 
arising  due  to  the'  creation  of  a  pair  by  a  photon  of  energy  E-^^-[-  E^. 

(ii)  The  term /i(^i  +  E^\  t)  R(^\E^  \  E^  +  E^)  in  (62)  arises  due  to  the  radiation 
of  a  photon  of  energy  E^  by  an  electron  of  energy  E^  -|-  E^^  and  thus  contributing 
to  the  mixed  product  density  fgi,i{E^ ,  E^. 

(iii)  The  term  /^  [E^^  I  E^ ;  0  R^^^  (E^  \  E^  +  E^)  in  (63)  represents  the  production 
of  a  photon  of  energy  E^  radiated  by  an  electron  of  energy  E^  -\-  E^,  thus  contri- 
buting to  the  mixed  product  density  fg^^  {E^,  E^). 

(iv)  The  absence  of  a  correlation  term  in  equation  (64)  does  not  mean  that  the 
photons  are  uncorrelated  since  the  product  density  equations  are  coupled. 

If  the  correlation  terms  were  absent,  then  the  solutions  for  ^(J^i,  E2',  t)  could 
have  been  written  as      f^  (^^  ,E,J)  =  kf,  {E, ;  ()  f,(E^ ;  t)  (67) 

and  with  the  initial  conditions  the  equations  will  yield  the  trivial  solution  zero 
for  densities  of  all  orders. 

The  above  equations  have  been  completely  solved  using  the  Mellin  transform 
technique.  We  define  the  transforms, 

v{r,t)  =  j  W-^f^{E;  t)dE;         y{r,  t)  =  j  W'^g^iE;  t)dE  (68a) 

0  0 

oo  oo 

v^ir,  s,  t)^  I  dEf  dE'E'—'E'^-^UiE^E';  t)  (68b) 

0  0 

oo  oo 

ry(r,  s,  t)  =  f  dE  f  dE'  W-^E'^-^fg^^^iE,  E' ;  t)  (68c) 

0  0 

oo  oo 

y^ir,  s,  t)  =  j  dE  j  dE'  W-^E"-^g^{E,  E' ;  t) .  (68d) 

0  0 

The  diffusion  equations  can  be  written  in  the  matrix  form  as 

-^9)(2)(^^  5.  t)  =  [T{r)x  I  -^  I  X't{8)](p^''Hr,  s;  t)  +  ^p{r,  s;  i)  (69) 

where  /   Vzir,  s;  t)  \ 

(2)t  4\       I  ^y^^^  *'  ^)     I  nci\ 

I    vy{s,  r;  t)    J 

\  y^ir,  s',t)  I    ' 
/  is  a  2  X  2  unit  matrix,  ^  is  a  column  vector  defined  by 

y,{r,  s\  t)  =  xp^{r,  s)El'-'-^e-^r  +  s-^tj^xp^{r,  s)^J^''-2  e-"r  +  .-i«  (71) 
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where 


Wi(r,  s)  = 


W2{f>  «) 


y«r 


^r+s-1 


f^r+8-1       ^r+8-1 


a^ir,  5)(Z)-A^+,_i) 
0 

0 


and 


^(.,.).l^mu. 


(I+^If 


r  5 


r{r-s)     ["'\3    '  "J  {r  +  s+l){r  +  s) 
r{r)r{s)    f     s 


«2(^  «)  =  p/  ,  ';  i^ —  + 


(i-).^l 


According  to  the  initial  conditions  we  have 

v(s;0)  =  El-^',         y{s;0)  =  0. 

Hence  the  solution  for  the  inhomogeneous  equation  is  given  by 

t 
(p^^Hr,  s;  t)^C{r,  s;  t)  f  C'Hr,  s;  t')y){r,  s;  t')  dt' 

0 

where  ^  is  the  solution  of  the  homogeneous  equation 

-^{r,  s;  t)  =  [r(r)xI  +  Ixr{s)}C{r.,  s;  t) . 


(72) 

(73) 

(74) 

(75) 

(76) 

(77) 

(78) 


Corresponding  to  the  two  parts  y)^  and  ip^  of  t/i,  9)*^'  splits  into  9?'^'  and  9?^^* .  Thus 
we  find  that 


.(2) 


where  rj  {x)  is  the  diagonal  matrix  with  the  four  elements 


xt 


^^-\-  As  —  X 


K  +  1^8-^ 


[X.,  +  X^-x 


/^r  +  i^s  -  ^ 


(79) 
(80) 


(81) 


One  very  important  feature  of  this  matrix  is  that  it  remains  finite  even  if  any 
one  of  the  denominators  of  its  elements  tends  to  zero. 

From  the  theory  of  product  densities  the  mean  square  number  of  electrons 
with  energy  greater  than  E  is  given  by 

^[N\E;  t)}  =  ^{N{E;  t)]  +  f  j\{E^,  E,;  t)  dE.dE^  (82) 


E    E 

a  +  ioo  a  +  icx) 


f,{E„  E,;  t)  = -^^^^^    j    f    v,{r,s)El'E-,^drds. 


(83) 


a  -too  a-ioo 
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Numerical  calculations  have  been  made  using  the  following  approximation  as 
has  been  done  even  in  the  case  of  mean  numbers  for  all  but  small  thicknesses.^ 
Since  the  terms  involving  e'^^"*^^*^'  and  e" '*'+•-»'  exceed  those  involving  fi 
in  the  exponent  by  an  order  of  magnitude  it  is  found  that  terms  involving  fi 
can  be  neglected.  Thus  we  have 

or+  too   0+  lOO 

0{r,  a;  0 


a-  ioo  a-  too 


(r-l)(«-l) 


e-(^r+V«drd5  (84) 


where 


G (r,  s;t)=     '^       '^  J\       \ (85) 

X  (a,(r,  .)  fi,..0  +  ^^  ~  ^-^^1^-  ~  ^^  {i)(a:.  +  :..) 
-  ^^ /i,  +  (A^  -  a:)  (//,  -  x)]  a^ir,  s)  {D  -  ^r+s-i) 


a2(5,  r)(2)-.a,+,_i) 


(>a,  +  /^.-a;)(//,  +  /^,-a:)}-i  (37) 

with  X  =  2.r  +  s-l-      Xr  =  Xr  + fJLr  — X.  (88) 

The  numerical  results  are  tabulated  in  the  appendix. 

Higher  moments  of  the  number  distribution 

MesseP  and  his  collaborators  have  in  a  series  of  papers  made  a  systematic 
use  of  the  Janossy^  and  the  product  density  technique  for  the  calculation  of  the 
higher  moments  of  the  number  distribution  of  electrons  and  photons.  By  defining 


^  It  is  a  weU-known  mathematical  fact  that  under  suitable  circumstances  the  entire 
distribution  function  can  be  obtained  from  a  knowledge  of  all  the  moments.  The  question 
whether  it  is  possible  to  obtain  a  valid  approximation  from  the  first  few  moments  is  interesting. 
This  has  claimed  the  attention  of  Messel  and  Green. 

2  A  comprehensive  list  of  references  is  given  in  a  review  article  by  H.  Messel  in  Progress 
in  Cosmic  Ray  Physics  Amsterdam  (1954),  Vol.  2,  Chapter  IV. 

3  L.  Janossy,  Proc.  Roy.  Irish  Acad.  A  63,  181  (1950). 
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Mellin  transforms  for  higher  density  expressions  and  writing  the  resulting  equa- 
tions in  the  matrix  form  they^  have  obtained  formal  and  explicit  solutions 
of  the  equations  where  ionization  loss  is  neglected.  In  a  subsequent  contribution^ 
they  have  discussed  the  modifications  that  are  necessary  when  ionization  loss  is 
taken  into  account.  In  this  case  following  Bhabha  and  Chakrabarthy^  they  have 
assumed  that  the  higher  density  functions  can  be  expanded  as  an  infinite  series 
in  positive  power  of  ^  and  obtained  recurrence  relations  between  the  transforms 
of  various  densities. 

Alternative  approach  to  the  cascade  theory 

Till  now  we  were  interested  in  the  probability  distribution  of  the  number 
of  particles  at  a  given  depth  t  assuming  the  shower  to  be  initiated  by  a  particle 
at  ^  =  0.  However  in  nuclear  emulsions  it  would  be  more  convenient  to  estimate 
the  energies  of  particles  at  the  point  of  their  production  rather  than  the  energies 
of  all  particles  at  a  particular  thickness.  This  suggested  a  new  mode  of  approach 
to  the  cascade  theory. 

Instead  of  dealing  with  the  probabiHty  distributions  at  a  particular  thickness, 
we  define  the  function*  7t^{ni,n2,  .  .  .,n^,E^,  .  .  .,  E^\Eq,  t)  as  the  probability 
that  rij  particles  of  type  ;  each  with  energy  greater  than  Ej  at  the  point  of  its 
production  (called  primitive  energy)  are  produced  between  0  and  t.  Although  it 
may  not  be  possible  to  determine  n  exactly  the  moments  of  Uj  from  n  can  be 
calculated. 

The  new  product  density  functions  are  defined  as  follows : 

Fj,{E^,  E„  ...,E„'J„t^,..  .,  y  d^i  d^2  .  .  .  dEj,  dt,  dt^...  dh 

is  the  joint  probability  that  an  electron  of  energy  between  E^  and  Ei  +  dE-^^ 
is  produced  between  ^^  and  1^  +  dt^,  an  electron  of  energy  between  E2 
and  E2  +  dE^  is  produced  between  t^  and  t^^-  dt^  and  an  electron  of  energy 
between  Ej^.  and  Ejc-\-  dEj^.  is  produced  between  tj^.  and  tj^  +  dtj^ .  F j^.  is  the  product 
density  of  degree  k  for  electrons.  Similarly  G]^  product  density  of  degree  k  of 
photons  and  FGi^jg  the  mixed  product  density  of  I  electrons  and  k  photons 
can  be  defined. 

The  product  densities  F]^,  Gjg,  FGi^^-i  are  over  E  space  as  well  as  t  space. 
Using  the  properties  for  the  product  density  functions  it  is  found  that  if  the 
product  densities  are  integrated  over  E^,  E2  .  .  -  Ejc  over  the  range  Ec  to  00 
they  are  still  product  densities  of  particles  produced  between  t^  and  t^  +  d^^- 
{i  =  1,  2,  .  .  .,k),  the  primitive  energy  of  each  particle  being  greater  than  E^. 
The  moments  can  be  obtained  from  this  by  integration  over  the  proper  range 
of  t.  For  example  if  we  are  interested  in  the  r-th  moment  of  the  number  of  elec- 


1  H.  Messel  and  R.  B.  Potts,  Phys.  Rev.  86,  847  (1952). 

2  H.  Messel  and  R.  B.  Potts,  Phys.  Rev.  87,  759  (1952). 

3  H.  J.  Bhabha  and  S.  K.  Chakrabarthy,  Phys.  Rev.  74,  1352  (1948). 

^  A.  Ramakrishnan  and  S.  K.  Srinivasan,  Proc.  Ind.  Acad.  Sci.  44,  263  (1956). 
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trons  produced  between  0  and  ty  the  primitive  energy  of  each  of  the  electrons 
being  greater  than  Eg ,  it  is  given  by 

00  oo  oo 

^{N'^(E,,  t))=2JClfdE,fdE^..  .JdE,- 

*         Ec  Ec  Ec 

t  t  t 

fdt,fdt^ . .  .fdt.Fd^Ji  ...Eiih,..  ti)  (89) 

0  0  0 

CJ"  being  abeady  defined. 

Equations  for  the  product  densities  of  degree  1  alone  will  be  given  since  explicit 
solutions  for  degrees  greater  than  2  are  not  available  and  numerical  evaluation 
for  even  degree  2  is  very  difficult. 

Assuming  the  usual  Bethe-Heitler  cross-sections  and  neglecting  ionziation  loss 
the  equations  for  product  densities  of  degree  1  can  be  written  as 

F^^^{E\Eo;  0  =  2  fg['HE',  E^\  t)  R<^){E\E')dE'.  (90) 

E 

G[^^{E\Eo;  t)  =  jif{E'\E^,t)R^^){E'-E\E')dE'  (91) 

E 

yj'^  and  f^^  are  old  product  densities  of  degree  1  of  photons  and  electrons  respec- 
tively. Defining  the  Mellin  transforms  P<'>(5  \Eq,  t)  and  Qf(s  \Eq\  t)  as 

oc 

Pf{s\Eo;  t)  =  fF^^^{E\Eo;  t)E'^-^dE  (92) 

0 

00 

Q^^^{s\Eo;  t)  =  lGf{E\Eo\  t)E'-'^dE  (93) 

0 

we  obtain 

P['Hs\Eo;t)^y['^{s\Eo;t)B,  (94) 

Q^^HslE,;  t)  =  v['Hs\Eo',  t)C,  (95) 

y^*^  and  v^^  being  the  Mellin  transforms  of  g^^^  and  f^^  respectively. 

Similar  equations  for  product  densities  of  degree  2  can  also  be  written  down  but 
we  do  not  refer  to  them  since  numerical  results  are  not  available. 

The  asymptotic  behaviour  of  cascades  for  large  thicknesses  can  be  explained 
as  follows :  It  is  well-known  that  the  old  product  densities  and  all  the  moments 
of  the  number  of  particles  above  a  certain  energy  Ef,  {Ec  >  0)  at  thickness  t  tend 
to  zero  as  ^  ^-  00 .  This  is  obvious  from  a  physical  point  of  view.  Since  the  Bethe- 
Heitler  cross-sections  are  homogeneous  the  total  cross-section  is  independent 
of  the  energy  of  the  particle  and  as  such  the  probability  of  finding  a  particle 
above  a  certain  energy  Ec  at  t  tends  to  zero  as  ^  ->  oo .  As  the  new  product  densities 
are  expressed  as  linear  combinations  of  the  old  densities,  for  the  same  reason, 
the  moments  of  the  number  of  particles  produced  between  zero  and  t  with  this 
primitive  energy,  being  greater  than  Ec  tends  to  a  finite  limit  as  ^  ->  oo . 


CASCADE    PROCESSES  443 

The  mean  numbers  of  electrons  and  photons  produced  between  0  and  t,  the 
primitive  energy  of  each  of  them  being  greater  than  E^ ,  are  given  by 


(96) 


0  a-ioc 


0  a-ioo 

Inverting  the  order  of  integration  over  s  and  r,  we  have 


E.V-^ 


ds  (98) 


a-ioo 


Ag  fig 

a  +  'i  oo 


^      ^«  (1  _  e-^st)  +  ifl__^  (1  _  e-A^,<) 


K  /^8 


ds.  (99) 


From  this  it  can  be  seen  that  S[jV'(Ec  \E)]  and  S[j^{Ec  \Eq)\  tend  to  a  finite 
limit  as  ^  ->  00 .1 

Further  refinements  to  cascade  theory 

New  approach  by  the  regeneration  point  method 

The  essential  difference  between  the  old  and  the  new  approaches  to  cascade 
theory,  while  apparent  from  the  definition  of  the  two  types  of  the  product  density 
functions,  is  strikingly  brought  out  by  the  equations  representing  them  when 
we  employ  the  regeneration  point  technique.^  We  shall  first  discuss  the  simple 
case  of  cascade  consisting  of  only  one  type  of  particle  when  its  development  is 
defined  by  the  cross-section  R{E'  \E)  dE^  dt,  representing  the  probability  that 
a  particle  of  energy  E  drops  to  an  energy  interval  lying  between  E'  and  E'  +  dE', 
producing  another  particle  of  energy  E  —^  E'  in  traversing  a  matter  of  thick- 
ness d^. 

Let  7ii{n,  E  \EQ\t)  represent  the  probability  that  n  particles  are  at  t,  the  energy 
of  each  particle  being  greater  than  E  and  7iji(n,  E\EQ,t),  the  probability  that 
there  are  produced  n  particles  between  0  and  t  each  having  energy  greater  than  E 
at  the  point  of  the  production,  with  the  primary  energy  in  both  cases  being  Eq  . 


^  S.  K.  Srinivasan  and  N.  R.  Ranganathan,  Proc.  Ind.  Acad.  Sci.  45,  69  (1957). 
^  Alladi  Ramakrishnan  and  K.  Venkatesan,  loc.  eit. 
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This  diflference  in  reckoning  the  energies  of  the  particles  in  tij  and  Tin  gives  rise 
to  a  corresponding  difference  in  the  summation  in  the  equations  satisfied  by 
them. 

Consider  the  first  regeneration  point  t  where  the  primary  particle  of  energy  E^^ 
is  replaced  by  two  particles  of  energy  E*  and  Eq  —  E^  which  become  in  turn 
independent  primaries  of  subsequent  cascades.  For  7ri(n,  E  \Eq,  t)  these  two  pri- 
maries together  produce  7i  particles  with  energy  greater  than  E  in  a  distance 
t  —  T,  i.e.  there  exist  at  t,  n  particles  with  energy  greater  than  E.  However  for 
Tiiiin,  E  \Eq\  t)  the  counting  of  particles  is  different  as  it  depends  on  the  energy 
at  the  point  of  production.  Since  even  the  primary  particle  with  energy  Eq^  E 
at  /  =  0  forms  part  of  the  system  of  particles  with  primitive  energy  greater  than  E 
which  are  produced  between  0  and  t  it  is  clear  that  the  two  independent  primaries 
at  T  need  produce  together  only  n  —  1  particles,  of  course  including  the  two 
primaries.  We  can  now  write  down  the  equations  satisfied  by  Ttj  and  Tin 

t  £, 

7ri(w,  E\E^\  t)=  fdre-^^^')'  I  RiE'\Eo)dE' 

0  E 

X      2]    jtiim,  E\E\  t-T)7ii{m',  E\Eo-E\  t-T) 

in  +  m'  =  w 

+  e-R^^o)tS{n-\)7tj{7i,  E\E^\  t)  (100) 

t  E, 

.T„(«,  EEq',  t)  ^  f  dre-R^^'^'  f  B{E'\Eo)dE' 

0  E 

X       2/     ^ii(w.  ^!^o;  t-T)7iji{m\  E\Eo-E';  t-r) 

m  +  ni'  =  n-1 

+  e-R(^o)tS{n-l)7ijj{n,  E\E^;  t)  (101) 

where 

R{E,)  =  JR{E'\E^)dE'. 

0 

Defining  the  generating  function  G{u,E  \EQ\t)  Si^ 

Gj^^^{u,  E\Eo;  t)  =^  w^7ri.n(^,  E\E^\  t)  (102) 

n  =  0 

the  integral  equations  yield  the  following  differential  equations  with  respect  to  <, 

E. 

-j^  {u,  E\Eo;  t)  =  j  [G^(u,  E\E';  t)  G^  [u,  E\  E^;  -E';t-Gj  {u,  E\E^',  t)] 

E 

xR{E'\Eo)dE' 

E, 

— ^^  (u,  E\Eo;  t)  =J[u  Gu  [u.  E\  E' ;  t)  Gu  {u,  E\E^  -  E' ;  t)  -  (?„  {u,  E\  E^',t)} 

E 

X  R{E'\Eo)dE'.  (103) 
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Correlation  with  respect  to  thickness  of  matter  traversed^ 

Product  densities  hitherto  defined  refer  to  the  distribution  of  particles  either 
in  the  energy  space  in  the  old  approach  or  in  the  product  space  of  energy  and 
thickness  in  the  new  approach.  It  is  interesting  to  define  the  product  density 
J^^  {Ei;tj^,E^;t^,...,  E^^ ;  ^n  |^o '  ^o)  corresponding  to  different  thicknesses  such  that 
J^^d^i  d^  . . .  d^^  represents  the  probability  that  there  exists  a  particle  in  d^^ 
at  t^  one  in  d^^  at  t^  and  one  in  d^,^  at  t^.  This  is  also  the  mean  of  the  product 
of  the  numbers  in  the  range  djB7j  at  t^,  &E^  at  t^  and  dE^  at  t^.  If  we 
set  ^^  =  ^2  =  •  •  •  =  ^^  it  does  not  just  reduce  to  the  product  density  of  degree  n 
corresponding  to  a  particular  thickness.  The  T^-th  order  densities  corresponding  to 
different  thicknesses  is  connected  to  all  the  densities  of  order  m  less  than  or  equal 
to  71  corresponding  to  the  same  thickness.  We  here  establish  the  connection  for 
the  second  order.  For  ^a  >  ^i  >  ^c 

J^2(JS7i;  ^1,  ^2;  k\Eo\  to)  =  ff2{E„  E'^\E^',  t,- t,)  f^{E^\E'^;  t^-t,)dE'^ 

+  h{E^\E,',  t,  -  t,)ME,\E,',  t,  -  t,) .  (104) 

This  equation  is  obtained  by  the  following  argument.  The  particle  of  energy  E^ 

at  ^2  must  belong  to  a  cascade  generated  by  a  particle  of  energy  E2  {E^  +  ^1) 

or  by  the  particle  with  energy  E^  at  ty^.  The  first  and  second  terms  correspond 

to  these  possibilities  respectively.  For  the  case  when  /  is  homogeneous  in  the 

energy  variables,  the  Mellin  transform  of  .^^  (^^,  ^^ ,  ^2 ;  ^2  |^o5  ^0)  i^  obtained 

in  terms  of  the  Mellin  transforms  of  /^(E^^E^  |-^o»  ^1  —  ^0)  ^^^  /i(^;  0  where 

E2 
£  =  -=- .  It  is  to  be  noted  that  ^2  ^^  ^^*  S5mametric  in  t^  and  t^  as  should  be  ex- 

pected  from  the  evolutionary  nature  of  the  Markovian  process.  If  we  assume 
that  the  entire  cascade  is  generated  by  a  single  particle  of  energy  Eq  the  second 
order  density  at  t^  =  Iq  is  zero  and  hence  the  first  term  vanishes.  The  expression 
for  ^  then  reduces  to 

^{E,;t„  E^;  t^  \E,;  g  =  d{E,-  E^)  h{E^  \E^- 1^  ^  g.  (105) 

Defining  iV (^c ;  t^  and  N(Ec',  t^  as  the  random  variables  representing  the  number 
of  particles  above  energy  Ec  at  t^  and  at  t^  the  expectation  value 
S  [N  (E, ;  <i)  N  (E, ;  ^2)}  is  given  by 

00       00 

<^{N{E,;  t,)N{E,;  t^)}  =  f  f  ^^{E^',  t^,  E^;  t^lE^;  to)dE,dE^.       (106) 


Ec  Ec 


When  t^-^t^^t 


/i  (^2  1^2 ;  ^2  —  ^1)     tends  to     d  (^2  —  ^2) 
and  hence 

/ U {El ,  E'2\Eo,t-  to)  d {Eo  -  E^)  d E'^     becomes     ^ [E^ ,  E^^E^;  t  -  t^) 


1  Alladi  Ramakrishnan  and  T.  K.  Radha,  Proc.  Camh.  Phil.  Soc.  57,  843  (1961). 
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and  therefore 

=  U(E,,  E^;  t\Eo;  g  +  A(^i;  t\Eo;  t^)d{E^-E,),  (107) 

i.e.  the  density  ^^{Ei\  t^,E^;  t^  \Eq;  Q  in  the  limit  t^  ->  <i  does  not  just  represent 
the  second  order  density  but  also  includes  the  first  order  term  with  the  ^-function 
as  its  coefficient.  As  t^-^t^-^t 

<^{N{E,'J,)N(E,',t^)}^ 

00     oo  oo    oo 

/  JUiE,,  E^;  t\Eo;  gd^id^2+/  //i(^i;  t\Eo;  t^)  d{E,- E^)dE,dE^  (108) 

^"'r    J^c  Ef    Ec 

and  tends  to  ^{N'^(Ec;  t)}  as  it  should. 

In  the  case  of  cascades  with  many  types  of  particles  such  as  the  electron-photon 
cascade  the  second  order  density  ^  will  be  connected  not  only  to  the  product 
density  /.2  of  electrons  but  also  to  the  mixed  product  density  of  photons  and 
electrons  fg^  ^  of  all  orders  such  that 

^i'\E,;  t„  E^;  t^^E^-  g 

oo 

+  /i" (E, ;  <,i £„ ;  (o)  A" {E,\E, ;  t,  -  I,) 

+  ff9['\{E,;  E',;  t,\Eo;  t^)  f^HE^\E'^;  k-t,)dE'^.  (109) 

E, 

where  superscript  i  ^  1  represents  an  electron-generated  shower  and  ^  =  2  a 
photon-generated  shower. 

The  equation  for  the  higher  density  ^^^  can  be  formally  written  down  and  in 
this  case  it  involves  ^^  where  m  <  n,  and  /gr^^  such  that  I  +  m  ^  n. 

Polarization  phenomena  in  cascades 

So  far  in  our  treatment  of  the  soft  cascades,  the  parameters  used  to  describe 
the  longitudinal  development  were  mainly  the  thickness  of  matter  traversed 
and  the  energies  of  the  different  types  of  particles  comprising  the  cascade.  Since 
now  it  is  possible  to  study  cascades  initiated  by  polarized  electrons  available 
from  ^-meson  decay,  the  question  arises  whether  it  is  possible  to  include  the 
state  of  polarization  of  particles  in  the  description  of  cascade. 

The  inclusion  of  polarization  brings  in  its  wake  a  new  feature  not  present  iii 
our  previous  treatments.  An  electron  can  exist  in  a  '  'forward' '  {F)  or ' '  backward"  {B)  I 
polarization  pure  state  with  respect  to  its  direction  of  motion  or  in  any  other 
state  of  polarization  which  can  be  expressed  as  a  linear  combination  of  F  and  B 
states.  This  feature  has  to  be  included  in  the  product  density  functions  which  is 
best   achieved    by   adopting   the   density   matrix   formalism.    Thus   we   define 
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7iij{E  \Eq;  t)  and  yij{E  \Eq;  t)  to  be  the  product  density  function  of  degree  one 
for  electrons  and  photons  where  the  labels  i  and  j  are  F  and  B.  It  is  to  be  noted 
that  these  functions  are  product  densities  only  in  E  space  and  the  various  values 
of  the  suffixes  i  and  j  jdeld  the  four  types  of  product  densities  tz^f^  ^fb^  ^bf 
and  ji^s  for  the  electrons  and  similar  functions  for  the  photons. 

To  write  down  the  equations  for  Ji^/s  we  need  the  matrix  elements  of  the 
processes  rather  than  the  total  cross-sections.  For  in  cascades  with  the  inclusion 
of  the  state  of  polarization  of  the  particles,  the  progeny  of  a  particle  depends 
very  much  on  the  state  of  polarization  of  the  "parent"  which  will  be  determined 
by  the  relative  phases  of  the  production  matrix  elements  for  the  parents.  This 
feature  is  essentially  due  to  quantum  mechanics  and  it  complicates  the  cascade 
equations.  However  a  great  simplification  is  possible  if  we  neglect  the  angular 
distribution  of  particles  by  integrating  over  the  angles.^ 

Dyson  2  has  studied  in  detail  the  variation  with  depth  of  the  mean  of  number 
of  longitudinally  polarized  particles  which  is  defined  by 

tzjAE  \Eo;  t)  =.  7Zff{E  I^o;  t)-^BB{E  \Eo;  t). 

The  function  gives  the  mean  excess  of  the  forward  polarized  particles  over  the 
backward  ones.  The  usual  cascade  function  will  be  the  sum  of  the  jipp  and  jt^^. 
Equations  dealing  with  polarized  cascades  according  to  the  new  approach  can 
be  studied.  The  equations  for  the  different  product  densities  of  degree  two  to 
deal  with  the  fluctuation  problem  can  also  be  formally  solved.^  But  at  the  present 
stage  of  measuring  polarization  of  particles  in  the  cascade  showers,  these  dis- 
cussions are  not  of  direct  experimental  interest. 

Discussion  of  numerical  results 

Before  discussing  the  results  of  the  cascade  theory  with  and  without  ionization 
loss,  we  note  the  following  important  features  of  the  cross -sections  for  i*adiation, 
pair  creation  and  ionization  loss. 

(i)  If  we  choose  the  unit  of  length  according  to  (28)  the  cross-sections  for 
radiation  and  pair  creation  are  very  nearly  independent  of  the  material.  Their 
only  dependence  on  the  material  is  through  the  term  4/3  +  a  where  oc  is  very 
small  compared  to  4/3. 

(ii)  The  mean  energy  loss  per  unit  characteristic  length  at  very  high  energies 
is  equal  to  the  energy  itself,  i.e. 

^^  E  (110) 


dt 
or 


radiation 

E^E^er^  (111) 


1  F.  J.  Dyson  and  K.  W.  Mcvoy,  Phys.  Rev.  106,  1360  (1957). 

2  F.  J.  Dyson  (unpublished.) 

3  N.  R.  Ranganathan  and  R.  Vasudevan,  Proc.  Phys.  Soc.  (Lond.)  76,  659  (1960).  See 
also  Proceedings  of  the  Summer  School  of  Theoretical  Physics,  Part  II,  261  (1959),  Ministry  of 
Scientific  and  Cultural  Affairs,  Government  of  India,  New  Delhi. 
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where  Eq  is  the  initial  energy  and  t  is  measured  in  cascade  units.  Thus  we  can 
also  interpret  the  cascade  unit  as  the  distance  in  which  the  energy  of  a  particle 

becomes  —  times  its  initial  value, 
e 

(iii)  The  ionization  loss  /5  is  taken  to  be  independent  of  the  energy  and  is  pro- 
portional to  -^ .  The  critical  energy  is  defined  as  the  energy  at  which  radiation 

and  ionization  losses  are  equal.  Hence  it  follows  from  (100)  that  the  critical  energy 
E(,  is  equal  to  p.  A  table  of  the  values  of  /?  and  I  for  various  materials  is  given 
below. 


Material 

P  in  MeV 

I  in  gem 

Air 

84.2 

37.7 

Water 

83.8 

37.1 

Carbon 

102 

44.6 

Nitrogen 

89 

39.4 

Oxygen 

78 

35.3 

Lead 

7.8 

5.83 

(iv)  The  cross-section  for  Bremsstrahlung  becomes  infinite  as  the  photon  energy 
tends  to  zero  (infra-red  catastrophe)  i.e.  as  £  ->  1  in  R^^\  Therefore  the  total 
number  of  photons  of  energy  greater  than  or  equal  to  zero  tends  to  infinity. 
Since  we  have  assumed  the  cross-section  for  pair  creation  to  be  dependent  only 
on  the  ratio  of  particle  to  photon  energy  the  total  number  of  electrons  with 
energy  equal  to  and  greater  than  zero  also  tends  to  infinity.  However  it  is  meaning- 
less to  talk  of  electrons  with  energy  equal  to  zero.  We  should  consider  only  elec- 
trons and  photons  of  energy  greater  than  a  particular  value  E{ ,  the  numbers  of 


'max 


!FiG.  62.  Total  number  of  electrons  as  a  fimction  of  the  thickness  of  air  in  radiation 
units  (37.7  g  •  cm"^)  for  shower  initiated  by  single  photons  of  various  energies. 
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which  are  always  finite  so  long  as  ^^  >  0.  Mathematically  this  is  equivalent 

1  1 

to  saying  that  although  the  two  terms  j  R^^\e)  de   and    f  R^^^e)  s^~^  de  are 

0  0 

divergent,  their  difference  is  always  finite. . 

The  number  of  particles  N{E;  t)  above  E  varies  with  the  thickness  t  due  to 
particles  entering  and  leaving  the  energy  interval  {E,  E  +  d^).  At  the  beginn- 
ing of  the  cascade,  the  contribution  is  always  greater  than  the  depletion.  In  the 
€ase  of  cascades  without  ionization  loss  the  total  number  of  particles  above 
energy  ^  at  a  thickness  t  is  given  by 

^«  +  ^-^^e-^«}d5.    (112) 

[X  —  A  ] 


(113) 


N(E;  t) 

1 

~  2"7rT 

a 

a  +  loo 

r  (EoV^''   1 
J   \eI     s- 

-too 

iD- 

1  u- 

-X 

-J" 

The 

maximum 

of  N  occurs  when 

dN  {E, 

«)   . 

dt 


which  gives  the  corresponding  value  ^^j^x  to  be 


Wx(iS?)  =  1.01  loge^  -  1.57.  (114) 

The  value  of  N  at  this  point  is 

N^,AE)  ^  0.137  ^  {log^  -  l.Slj"  \  (115) 

Fluctuations:  The  development  of  a  cascade  is  a  multiplicative  process  from 
the  point  of  view  of  stochastic  theory  and  it  has  become  customary  to  compare  it 
with  two  stochastic  models,  the  Furry  and  the  Poisson.  The  first  represents  the 
simplest  multiplicative  and  the  second  the  simplest  non-multiplicative  process. 
We  shall  first  discuss  the  principal  features  of  these  models  before  comparing 
the  cascade  process  with  them. 

We  define  a  Furry  process^  as  follows:  Considering  only  one  type  of  particle, 
if  we  postulate  that  AdHs  the  probability  that  a  particle  produces  another  in 
thickness  d^,  then  the  probability  that  a  new  particle  is  produced  in  thickness  d^ 
i^nXdt  if  there  are  n  particles.  The  probability  7r(7i,  t)  that  the  stochastic  variable 
n  {t)  representing  the  number  of  particles  at  thickness  t  \^  n  satisfies  the  equation 

4^^^' ^^-A[(7J- 1)71(^-1;  t)-n7i{7i;  t)].  (116) 

dt 


^  W.  H.  Furry,  Phys.  Rev.  52,  569  (1937).  See  also  A.  Ramakrishnan's  article  in  Hand- 
buch  der  Physik,  loc.  cit. 
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Assuming  that  at  <  =  0,  there  is  only  one  particle,  i.e.  7r(l,  0)  =  1,  7i{7i,  0)  =  0 
if  »  4=  1 .  we  obtain  the  Furry  distribution 


ji{n',  t)  = 


^-^r"        ''''' 


^{n{t)} 
where 

S'{n{t)]  =  e^^ .  (118) 

The  mean  number  <f  {« (O)  increases  exponentially  and  therefore  has  no  maximum. 
A  very  interesting  feature  of  this  distribution  is  that  as  regards  the  discrete 
variable  7t,7i  has  a  maximum  for  w  =  0  and  falls  of  monotonically  with  increasing w. 
If  we  define  a  stochastic  variable  m{t)  =  n{t)  —  I,  i.e.  m(t)  represents  the  secon- 
daries, then 

S{m{t)}  =  e^^-l  (119) 

and 

o^{m{t))  =  <f  {m2(0}  -  [^(m{t)}f  =  [^{m{t)]f.  (120) 

The  mean  square  deviation  of  n  {t)  is 

o%  =  ^{?iHt))  -  [S'{n{t)]f  =  [^{n{t)}f  -  (^{n{t)].  (121) 

If  e^^  >  1,  the  distinction  between  m{t)  and  n{t)  is  not  important  and  we  have 


o%  ,  1 


r    .v.o=l r-TTT'^l     if     e^^»l.  (122) 

e{n{t)}^  e{n{t)} 

In  a  Poisson  process,  AdHs  defined  as  the  probability  that  a  new  particle  is 
produced  in  dt  and  is  independent  of  the  number  of  the  existing  particles  (non- 
multiphcative).  Then  7i(n,t)  satisfies  the  equation 

^'"■''^  =  [ji(»-l;  <)-;!(»;  <)]A  (123) 

with  3i(0;  0)  =  1,  ji(re;  0)  =  0,  »  =|=  0.  This  yields  the  Poisson  distribution 

„(n;t)  =  e-"-^  (124) 

ni 

and 

(f{n{t)}=^Zt  (125) 

ol  =  ^{n{t)}  (126) 

i.e.  if  we  write  aj  =  kG%,  A:  ~  0.  It  does  not  however  imply  as  we  shall  presently 
see,  that  if  A:  ^  0,  the  process  resembles  the  Poisson  process.  In  comparing  any 
stochastic  process  with  these  two  simple  models  we  obtain  the  mean  ^{w}  and 
the  mean  square  number  ^{/i^J  ^nd  express  the  mean  square  deviation  a^  as 

G^-^kl  =  h'ol.  (128) 
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Such  a  comparison  is  meaningful  only  if  <^[n]  :p  1  (say  at  least  10).  If  k^l 
we  say  that  the  process  resembles  the  Furry  type.  But  if  ifc  :^  0  we  cannot  conclude 
that  the  process  is  of  the  Poisson  type,  since  we  may  still  have  k'  :$>  I.  This 
happens  when  ^{w(^)}  >  1  such  that 

2  2 

FFwJP^^    '"'*    7R0)=^'>>1    *^''"§''    «='«^{»W}.    (129) 

This  fact  does  not  seem  to  have  been  realised  in  making  comparisons  with  the  Furry 
and  Poisson  processes  in  many  treatments  of  the  cascade  theory.  In  comparing  the 
cascade  process  with  a  Furry  model,  it  has  already  been  observed  that  the  mean 
number  above  a  certain  energy  reaches  a  maximum  in  the  cascade  process  while 
in  a  Furry  type  there  is  no  maximum.  Therefore  it  is  considered  worthwhile 
comparing  the  distribution  of  this  number  with  the  Furry  distribution  as  t  varies. 
It  is  obvious  that  at  thicknesses  small  compared  to  ^j^ax  the  distribution  should  be 
of  the  Furry  type  provided  the  mean  number  is  greater  than  about  ten. 

If  we  are  considering  the  total  number  of  particles  (above  a  certain  energy) 
the  distribution  will  be  exactly  of  the  Furry  type  if  there  were  only  one  type  of 
particle  and  the  cross-section  is  independent  of  the  energy.  In  the  case  of  electron- 
photon  cascades  we  have  two  types  of  particles  and  the  computation  of  the  total 
number  is  rendered  difficult  in  view  of  the  infra-red  catastrophe.  However  o^si^m- 
in^  the  total  cross-section  to  be  finite  it  can  be  shown  by  a  simple  calculation 
that  the  electrons  and  the  photons  obey  independently  an  approximately  Furry 
distribution  and  that  they  are  correlated  in  a  manner  such  that  the  electrons 
and  photons  together  have  also  a  Furry  distribution. 

In  the  actual  case  of  the  electron-photon  cascade  we  compare  the  number  of 
particles  above  a  certain  energy  with  the  Furry  type.  The  numerical  calculations 
on  the  fiuctuations,  i.e.  the  determination  of  k  and  k'  have  been  done  by  Rama- 
krishnan  and  Mathews. ^  An  examination  of  the  tables  leads  to  the  following 
conclusions : 

(i)  For  a  given  y,  the  thicknesses  at  which 

S[n},S{n%o^^     and      ^^^^^^l  ^^^^  (130) 

are  maximum  are  all  different  as  expected. 

(ii)  The   negative   values   of   ^^r  .^ — IfTT  correspond   to  negative  values 

[[^' W]^  —  S[n]\,  i.e.  when  S[n]  <  1. 

(iii)  For  very  large  values  of  y  but  before  the  shower  maximum  the  deviation 
is  a  finite  fraction  (about  0.1  to  0.2  times)  of  the  Furry  deviation. 

(iv)  Comparing  with  the  Poisson  distribution  it  is  found  that   ^,   .    increases 

6  {n^ 

with  y  for  a  given  value  of  t.  At  the  shower  maximum  the  deviation  is  many 


1  A.  Ramakrishnan  and  P.  M.  Mathews,  loc.  cit.  (1954). 
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times  the  Poisson  value  (a  result  mistakenly  supposed  to  be  at  variance  with 
that  of  Scott  and  Uhlenbeck^  and  Janossy  and  Messel*).  For  example,  for  y  =  8 
the  maximum  occurs  between  t  ^  6  and  ^  =  8  and  the  deviation  is  about  twelve 
times  the  Poisson  value  or  0.1  times  the  Furry  deviation. 

In  a  theory  including  ionization  loss  numerical  results  are  available  only  for  the 
mean  numbers  and  it  is  naturally  interesting  to  compare  the  energy  spectrum 
when  collision  loss  is  included  with  that  when  it  is  neglected.  The  exact  solution  is 
given  by  the  equation  (55)  and  it  is  clear  that  as  P  tends  to  zero,  the  solution 
tends  to  (49).  In  the  infinite  series,  two  important  points  have  to  be  noted: 

(1)  By  the  definition  of  y)„  in  (53,  54)  y)i{s,  t)  =  0  and  it  suffices  to  take  only 
two  terms  of  the  series  ^q  and  y)2  • 

(2)  (f{s;t)  has  the  following  asymptotic  forms.  For  large  values  of  8, 


(Tis;  t)^ 


when     0L't>2 


a'  logs 
^  t  when     a'  <  <  2 


(131) 


where 


"■=(!-)■ 


The  actual  values  of  q){s;t)  have  been  tabulated. 
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Fig.  63.  (p{s,  t)  as  functions  of  s. 


It  is  therefore  found  convenient  to  discuss  the  results  for  small  and  large 
thicknesses  separately. 


1  W.  T.  Scott  and  G.  E.  Uhlenbeck,  Phys.  Rev.  62,  497  (1942). 

2  0.  Janossy  and  H.  Messel,  Proc.  Phys.  Soc.  A  63,  1101  (1950). 
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(1)  Small  thicknesses:  Correct  to  the  order  of  t^,  the  solution  reduces  to 

a  +  ioo 
a-  ioo 


l-Ast+i{AI  +  B,C,)f' 


Eo 


f" 


ds.  (132) 


If  we  neglect  the  term  with  the  coefficient  ^  in  the  square  bracket,  the  spectrum 
is  identical  with  that  without  ionization  loss  except  that  the  energy  parameter 
is  shifted  through  pt,  i.e.  the  density  at  E  when  ionization  loss  is  neglected  is 
equal  to  that  at  E  —  ^t  when  ionization  loss  is  included  which  implies  that  all 
the  electrons  lose  energy  deterministically  by  ^t.  This  is  obviously  incorrect  since 
some  of  the  electrons  are  created  at  some  intermediate  point  and  hence  lose  energy 
less  than  pt.  The  inclusion  of  the  term  with  coefficient  ^  corrects  this  error.  At 
small  thicknesses  for  initial  energy  ^q  >  j5  there  are  not  enough  particles  near 
the  critical  energy  p.  The  spectrum  in  the  neighbourhood  of  ^  is  important  only 
for  large  thicknesses. 

(2)  Large  thicknesses;  Numerical  results  have  been  obtained  by  Bhabha  and 
Chakrabarthy^  using  the  method  of  saddle  points  corresponding  to  a  given 
energy  E  and  thickness  t.  Each  term  in  the  infinite  series  for  f^iE,  t)  has  its  own 
saddle  point  but  we  need  only  the  saddle  points  Sq  and  s^  corresponding  to  tp^ 
and  ^2-  These  vary  very  little  with  E  and  therefore  we  can  take  the  values 
corresponding  to  Eq  .  For  energies  corresponding  to  p,  N^  [E,  t)  takes  the  form 

NJE;t)^\//.^'-;,r'-'N„W.  (133) 

\E  +  pcpis^;  t)  j 

Keeping  only  the  first  two  terms,  this  becomes 

A(^;*)^constxj(.„-l)(-^^.fiyr„+K+l)(~^f^V|    (134) 

Whenever  ^2  ^  ^0  ^^®  spectrum  is  determined  almost  entirely  by  the  term  ipQ. 
— -^ —  I  ,  i.e.  a  modified  power  law.  The  values  of  99  are  tabu- 
lated and  are  approximately  equal  to  1.  The  value  of  Sq  can  be  obtained  from 
Fig.  60  for  any  t  and  y^ .  At  the  shower  maximum  Sqc^2  and  before  the  maximum 
Sq  varies  from  1.3  to  2  with  increasing  t.  This  implies  that  the  power  in  the  modified 
power  law  increases  slowly  with  thickness  corresponding  to  an  increase  in  the 
number  of  electrons  at  the  low  energy  end.  The  spectrum  becomes  nearly  an 
inverse  square  law  (modified)  at  the  shower  maximum  and  the  role  of  the  terms  iV2  > 
iVg  etc.  is  to  accentuate  this  tendency  since  Sq  and  ^2  ^re  not  very  different  when 
7^2  ^nd  Nq  are  comparable.  From  equation  (124)  it  can  be  seen  that  due  to  the 
factors  (Sq —  1)  and  {Sq  +  1)  the  contribution  of  the  second  term  will  be  greater 


H.  J.  Bhabha  and  S.  K.  Chakrabarthy,  Phys.  Rev.  74,  1352  (1948). 
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than  the  first  even  if  N2  is  slightly  less  than  Nq  for  energies  lower  than  those 
determined  by 


»o-l    No 


(135) 


It  can  be  actually  shown  by  numerical  calculations  that  the  contribution  due  to  N^ 
for  the  above  range  of  energies  is  greater  than  that  of  Nq  . 

The  values  of  Nq  and  iNTj  for  various  y  and  t  are  given  in  the  appendix. 
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Fig.  64.  Nq  and  iVg  as  functions  of  t;  numbers  on  the  curves  indicate  the  values  of  y^. 


Intranuclear  cascades 

In  the  cascade  processes  treated  till  now,  the  quantum  mechanical  cross- 
section  per  unit  thickness  of  matter  for  fundamental  processes  were  assumed 
and  the  development  of  the  processes  with  thicknesses  was  studied  as  a  stochastic 
process.  For  the  electrodynamic  processes  the  cross-sections  are  known  while 
for  nuclear  interactions,  especially  processes  involving  multiple  production,  the 
cross-sections  are  derived  either  by  using  phenomenological  or  by  quasistatistical 
arguments.  So  speculative  is  such  a  study  that  it  has  been  suggested  by  Heitler 
and  Janossy^  that  the  cross-section  for  the  multiple  production  of  the  mesons 
can  be  calculated  by  considering  the  colUsion  of  a  fast  particle  with  a  nucleus  as 
a  cascade  phenomenon  within  the  nucleus.  In  analogy  with  the  thickness  of  matter 
we  have  in  this  case  the  nuclear  matter  within  the  nucleus  and  the  diameter  of 
the  nucleus  corresponds  to  the  parameter  t  at  which  the  cascade  distribution  is 
studied.  This  merely  implies  that  we  carry  over  the  mathematics  of  the  cascade 
theory  to  the  computation  of  the  probability  jt  (71 1^0'  0  that  7i-mesonsare  produced 


1  W.  Heitler  and  L.  Janossy,  Proc.  Phys.  Soc.  A  62,  669  (1949). 
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by  an  incident  particle  of  energy  Eq  in  passing  through  a  nucleus  of  diameter  t. 
This  yields  the  cross-section  of  the  process  which  in  turn  has  to  be  used  in  studying 
the  cascade  development  in  matter. 

Let  R{E'  \E)dE'  be  the  cross-section  for  the  emission  of  a  meson  of  energy 
E  —  ^',  i.e.  the  incident  nucleon  drops  from  an  energy  E  to  an  energy  between  E' 
and  E'  +  dE'.  If  7i{n,  E  \Eq;  t)dE  is  the  joint  probability  that  a  nucleon  of  energy 
Eq  (initially  at  ^  =  0)  after  passing  through  a  distance  t  has  undergone  n  collisions 
and  has  an  energy  between  E  and  E  -\-  dE  the  diffusion  equation  can  be  written 
as 

dn  {n,  E\Eq;  t) 


dt 


=  -7z(n,  E\Eo;  t)  f R{E'\E)dE' 

0 

+  fnin-l,  E'\Eq;  t)  R{E\E')dE' .  (136) 

E 

Heitler  and  Janossy  made  the  following  simplifying  assumptions  that  the  mesons 
in  turn  do  not  produce  further  particles  and  that  R(E' \E)  =  0  if  E  <C  E^. 
Defining 

n^^^n,  E  \Eq;  t)  =  7r{n,  E  \Eq;  t)  for  E  <  E, 

7r(2)(^^  E  \Eq;  t)  =  7i(n,  E  \Eq',  t)  for  E  >  E^,  (137) 

the  equation  breaks  up  into  two  parts : 

0 

+  fn^^^n-l,  E'\Eo;  t)R{E\E')dE'  (138) 

E 

(I)  ^" 

dn^(n,E\EQ;  ^)_  f    (,)^^_j^  ^,|^^.  ^^  ji^E\E')  dE'  (139) 

assuming!  '  ^^^ 

(i)  R{E'  \E)dE'  is  homogeneous  and  can  be  expressed  as  i2l-=-] — =— . 
(ii)  the  distribution  of  the  initial  energy  Eq  is  given  by 

Ec  Y'"^  dEQ 


with 


^i^o)  =  y(^J    ^^  (140) 


/i^(^o)d^o=l. 


1  A.  Ramakrishnan,  Proc.  Phys.  Soc.  A  63,  861  (1950). 
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We  can  obtain  p^  the  probability  of  the  occurrence  of  n-mesons  irrespective  of 
the  energy  state  of  the  primary  nucleon.  Further,  the  probability  that  n  collisions 
have  occurred  and  the  energy  of  the  primary  is  reduced  to  a  value  below  the 
critical  energy  can  be  defined  as 

oo  Et 

A'^  =  lF{Eo)dEofn^'\n,  E\E^',  t)dE  (141) 

Ec  0 

and  p\f^  as  the  probability  that  n  collisions  have  occurred  and  the  energy  of  the 
primary  is  still  above  Ef., 

oo  Et 

p^^^-=fF{Eo)dEof7i^^Hn,  E\Eo',  t)dE  (142) 

and 

Pn  =  />if^  +  />i?^.  (143) 

On  performing  the  integration  over  E  and  Eq  we  obtain 


dt 

et 


=  -pt^B  +  p'^l^R,,,  (144) 

=  -P^^liR-P^^liRy.i  (145) 


where  R  is  the  integrated  cross-section  and 

E 


r     /E'\/  E'V^'^  dE' 


Solving  the  equations  (144)  and  (145) 

j'«'  =  -~?«"'^?*i  (1*7) 

pSJ)  =  r„  (B  t)  ijy;  1  (1  -By.,)-  (148) 

It  should  be  noted  that  integration  with  respect  to  the  initial  energy  spectrum 
has  an  important  mathematical  consequence  that  it  has  the  same  effect  of  re- 
ducing the  integrals  with  respect  to  energy  to  products  as  in  the  case  of  the  Mellin 
transform,  the  real  exponent  y  taking  the  place  of  the  complex  variable  5.  Of 
course  in  this  case  we  have  one  obvious  simplification  that  "the  spectral  trans- 
forms themselves  represent  the  original  functions"  while  in  the  MelUn  transform^ 
the  inverse  has  to  be  taken.  In  fact  in  the  calculation  of  mean  numbers  the  inte- 
gration can  be  performed  over  the  initial  spectrum.  This  simplification  by  the 
integration  over  initial  energy  spectrum  is  not  possible  for  product  densities  of 

E^ 
degree  two  since  such  integration  over  Eq  will  affect  both  the  variables  e^  =  -^ 

E 
and  £2  =  -T^  occurring  in  f^{E^,  E^  \Eq;  t). 
^0 
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3.  INTERACTIONS  AT  EXTREMELY  HIGH  ENERGIES^ 

The  quantitative  study  of  cascade  theory  has  till  now  been  confined  to  the 
longitudinal  development  of  electromagnetic  cascades.  Since  there  is  no  upper 
limit  to  the  possible  energy  of  the  particles  in  the  primary  cosmic  radiation, 
cascade  showers  can  be  initiated  by  particles  of  energy  much  greater  than  10^^  eV 
and  we  have  to  take  into  account  all  types  of  elementary  particle  interactions, 
as  well  as  the  lateral  spread  of  the  showers. 

Such  extensive  air  showers  (EAS)  initiated  by  primaries  which  are  invariably 
nucleons,  consist  of  mesons,  nucleons  and  strange  particles  produced  in  nuclear 
and  electromagnetic  collisions  at  high  energies.  It  has  been  found  experimentally 
that  the  particles  which  arrive  at  sea  level  are  spread  over  large  areas  of  the 
order  of  hundreds  of  metres.  The  lateral  spread  occurs  due  to  two  circumstances 
(i)  multiple  Coulomb -scattering,  (ii)  the  angular  distribution  of  particles  at  the 
point  of  production.  The  lateral  spread  of  the  electronic  component  is  mainly 
due  to  the  former. 

It  is  found  convenient  to  study  the  longitudinal  and  lateral  developments- 
separately.  Even  in  the  study  of  the  longitudinal  development,  with  our  inadequate 
knowledge  of  the  cross-sections  for  the  individual  processes  it  is  not  possible 
to  study  in  a  quantitative  way  the  particle  composition  of  cascades  in  finite 
thicknesses  of  matter.  However  it  is  possible  to  study  the  individual  reactions 
in  detail  in  "jet"  showers  in  nuclear  emulsion,  small  thicknesses  of  which  cor- 
respond to  large  distances  in  air.  As  regards  the  lateral  spread  we  shall  deal 
only  with  the  electromagnetic  cascades  which  will  give  an  idea  of  the  nature  and 
magnitude  of  the  problem. 

The  longitudinal  development  of  "jets" 

We  shall  outUne  the  general  features  of  the  "jet"  before  referring  to  the  present 
experimental  situation. 

(a)  The  shower  axis:  We  define  the  axis  of  a  shower  as  the  direction  of  the 
primary  which  of  course  is  isotropicaUy  distributed  in  space  (except  for  the  for- 
bidden directions  according  to  the  geomagnetic  theory).  Since  the  high  energy 
particles  are  extremely  collimated  we  can  speak  of  the  "core"  of  the  shower. 
The  shower  axis  in  the  emulsion  is  obtained  by  extrapolating  the  primary  track 
beyond  the  origin  of  the  jet  and  is  determined  as  follows :  If  e^  is  the  energy  of 
the  particle  making  an  angle  q)i  with  the  approximate  shower  axis,  a  large  number 
of  such  particles  are  chosen  such  that  ^e^  sin  (pi  equals  zero.  The  track  closest  to 

i 

the  shower  axis  may  be  chosen  as  the  reference  for  all  angular  measurements. 

(b)  The  nature  of  the  tracks :  The  charged  secondaries  are  classified  into  three 
groups  depending  on  the  grain  densities  of  the  tracks.  The  black  and  grey  tracks 


^  "The  extensive  air  showers  "  by  K.  Greisen,  Progress  in  Cosmic  Ray  Physics,  3,  3  (1956). 
Extensive  Air  Showers  by  William  Galbraith,  Butterworths  (1958). 


458 


ELEMENTARY  PARTICLES  AND  COSMIC  RAYS 


correspond  to  the  low  energy  protons  and  knock-on  particles  from  the  target 
nucleus.  The  minimum  tracks  correspond  to  the  shower  particles  produced  in 
nucleon-nucleon  collisions. 

(c)  The  angular  distribution:  It  has  been  found  that  most  of  the  secondaries 
emerge  in  a  narrow  cone  about  the  primary  axis,  a  circumstance  which  can  be 
explained  as  follows.  The  secondaries  produced  in  the  collision  of  a  stationary 
and  a  relativistic  nucleon  are  distributed  symmetrically  (but  for  statistical 
tiuctuations)  about  the  two  nucleons  in  the  centre  of  mass  system. 

The  angle  0  at  which  a  particle  emerges  in  the  laboratory  system  is  given  by 

1 


tan0  = 


V  sm 


y,(i;'cos6'  +  ft) 


with 


rc  = 


n-pi 


(1) 


where  v'  ,0'  are  the  velocity  and  angle  of  emission  in  the  centre  of  mass  system 
and  Pf.  the  relative  velocity  of  the  two  frames.  We  shall  hereafter  denote  the 
various  quantities  in  the  laboratory  (L)  and  centre  of  mass  (C)  system  by  un- 
primed  and  primed  variables.  It  can  be  shown  (see  appendix)  that  to  a  given 
angle  d  m  L  there  corresponds  two  angles  d[  and  d'^  in  C,  i.e.  there  are  two  velo- 
cities Vy^  and  Vg  in  L  at  the  angle  d  corresponding  to  the  same  velocity  v'  along  ©J 
and  ©2-  The  correspondence  between  the  various  angles  in  the  two  systems  is 
as  follows: 

For  a  given  velocity  v'  there  is  an  angle  dj^  such  that  the  two  angles  d[  and  d^ 
coincide  and  are  equal  to  d^  •  ^m  is  the  maximum  angle  of  emission  which  can  be 
assumed  in  L  by  a  particle  with  velocity  v'  in  C.  As  d'  varies  from  0  to  0^ ,  0  varies 
from  0  to  02tf  and  as  6'  increases  from  d'^  to  tt,  6  decreases  from  O'm  to  0.  Thus  all 
the  particles  are  emitted  within  a  cone  of  half  angle  Om  in  L.  Consider  now  the 
particles  emitted  in  the  forward  direction  in  O,  i.e.  0  <  0'  <  7il2 .  In  the  laboratory 
system  they  will  be  within  0  to  0 j.  where  dp  <C6m'  Let  62  =  O'c  be  the  angle  cor- 


71 


responding  to  0J  =  — 


.e.  particles  along  b'x  =  7r/2  or  02  =  ^c  will  emerge  along 


62?  in  L.  The  particles  emitted  between  0{=  n\^  and  b'^  will  lie  between  0^  and  b^ , 
those  between  d'^  and  0^  between  0^  and  d^  and  those  between  0^  and  n  between 
dp  and  0.  The  particles  emitted  in  the  backward  direction  in  the  C  system  fall 


n 


into  two  classes.  Those  with  -^^6^  <6c  will  lie  between  dp  ^6  ^6^  while  those 
with  d[^d'  ^n  lie  in  the  same  forward  cone  0  ^  0  ^  ^2^  as  the  particles  in  the 


Fig.  65.  Correspondence  between  the  angles  in  the  cm.  and  laboratory  systems. 
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forward  direction.  However  these  two  sets  of  particles  can  be  distinguished  by 
their  velocities  since  the  velocity  corresponding  to  62  is  always  smaller  than  that 
for  d{ .  Experimentally  the  cones  of  half  angles  6p  and  Om  can  be  easily  identified. 
The  degree  of  anisotropy  in  the  centre  of  mass  system  is  reflected  on  the  density 
of  particles  within  the  cone  of  angle  dp .  We  summarize  below  the  correspondence 
between  the  angles  in  the  two  frames. 


Centre  of  mass  system 

Laboratory  system 

0^6'^  nr2 

0^6  ^Of 

Y^^'^^M 

OF^e^dM 

e^M^o'^d', 

6m^6^  6f 

d'.^d'^Jt 

dF^o^e 

The  approximate  value  of  Op  in  radians 

;  is  given  by 

e^^ 

-f 

iM^ 

Ep 

where  the  nucleon  mass  Mp  and  the  primary  energy  Ep  are  expressed  in  BeV. 


(d)  Determination  of  energies:  The  energy  of  the  primary  particle  cannot  be 
measured  directly.  It  is  usually  estimated  from  the  approximate  relation 

2M^         2 


^P^-^^-zr     BeV.  (2) 


'F  ^F 

This  relation  is  valid  only  for  nucleon-nucleon  collisions  with  isotropic  distribution 
of  secondaries  in  the  centre  of  mass  system. 

In  the  case  of  a  nucleon-nucleon  collision  the  total  energy  W  is  given  by 

Jf  =  2y?if^.  (3) 

A  fairly  accurate  method  of  determining  the  energy  is  through  y^ .  The  two  methods 
of  obtaining  y^  are  given  below. 

(i)  If  ni  and  nf  are  the  number  of  forward  and  backward  shower  tracks  (which 
can  be  identified  in  the  laboratory  system  by  separating  out  the  cones)  and  ^f , 
and  99^^  the  angles  of  the  individual  tracks  in  the  laboratory  system  then 

logy,  =  ——  2]  logtan99/,.  +  ——  J]  logtan^o^,..  (4) 

^  n*     i  Li  rtff     J 

(ii)  If  E  and  E'  are  the  energies  of  the  shower  particles  in  the  two  system 
and  (pi  the  space  angles  between  the  track  and  the  forward  direction  we  have 
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The  plot  of  —  against  tan^o?,.  is  a  straight  line  when  all  the  secondaries  are  emitted 

^  1 

with  the  same  energy.  The  intercept  on  the  E  axis  is  ~ ^and  the  slope  iaydE'^ 

(e)  The  transverse  momentum^:  Besides  energy,  the  transverse  momentum  p^ 
of  the  particle  measured  in  a  direction  perpendicular  to  the  line  of  flight  of  the 
primary  is  an  important  characteristic  of  the  particle.  It  was  first  noted  by  Nishi- 
mura  that  p^  is  almost  independent  of  the  energy  of  the  primary,  for  example, 
for  pions  p^  ^  0.5  BeV/c.  Since  the  perpendicular  component  is  unaffected  by 
the  Lorentz  transformation  the  value  of  p^  is  identical  in  the  two  systems.  The 
average  p_^  of  the  y  rays  was  expected  to  be  half  that  of  the  parent  pion.  However 
the  transverse  momentum  of  the  y  rays  measured  in  a  direction  perpendicular 
to  the  direction  of  the  pion  was  found  to  be  very  small  which  can  be  neglected 
in  comparison  with  that  of  the  pion.  The  mean  value  of  p^  for  non-pions  is  found 
to  be  of  the  order  of  1 .5  BeV. 

An  approximate  method  of  measuring  the  energy  is  to  assume  the  constancy 
of  p^.  The  measurement  of  angular  distribution  in  the  centre  of  mass  system  will 
yield  the  total  energy  of  the  secondary  particles.  The  laboratory  angular  distri- 
bution can  be  easily  measured  as  most  of  the  secondaries  are  pions. 

(f)  Inelasticity :  The  inelasticity  /  of  a  process  is  the  fraction  of  the  total  kinetic 
energy  of  the  nucleons  in  L  transferred  to  the  newly  created  particles.  That  it 
is  not  possible  for  two  colliding  particles  to  lose  all  their  kinetic  energy  in  iy  is- 
clear  from  the  following  argument.  In  the  C-system  the  total  momentum  is 
zero  and  both  the  particles  can  be  brought  to  rest.  This  corresponds  to  the 
maximum  transfer  of  energy  to  the  newly  created  particles.  Observing  the  col- 
lision in  the  Z-system  the  nucleons  will  have  a  velocity  the  magnitude  of  which, 
is  equal  to  that  of  the  C-system.  The  value  of  I^^j.  is  given  by 

(6) 


rc+1 

We  briefly  summarize  the  various  methods  of  determining  /. 

(1)  /is  given  by  the  relation 

i 

where  Si  and  e  are  the  energies  of  the  i-th  and  the  primary  particles  respectively. 
Experiments  in  air  showers  indicate  that  a  primary  may  retain  up  to  80  per  cent 
of  its  energy,  i.e.  I  ^  0.2. 

(2)  /  can  also  be  determined  from  measuring  the  ratio  AattMhit  where  the 
attenuation  length  Aatt  is  defined  as  the  distance  in  which  the  number  of  stable 
particles  above  energy  Eq  (with  a  spectral  energy  distribution)  falls  off  by  a 

factor  — .  The  ratio  S  =  AattMint  is  always  ^  1,  the  equality  holding  only  when  all 


1  See  S.  Hasegawa,  K.  Niu,  Institute  for  Nuclear  Study  Reports  (1958). 
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the  interactions  result  in  catastrophic  absorption.  The  extent  to  which  S  exceeds 
unity  depends  on  /.  Experiments  with  primaries  of  energy  r^  100  BeV  yield  a 
value  of  /  ^  0.5  while  at  10'  BeV,  /  ^  0.3. 

(3)  A  third  method  of  determining  I  from  individual  events  is  by  Daniel^  et  al. 
The  quantity  ^ 

^=^n(i -Acose)  =  ifn/(^)  (8) 


im 


was  determined  for  each  track  in  the  outer  fringe.  Here  M  is  the  ratio  of  the  rest 
mass  of  a  secondary  to  that  of  a  proton,  e  the  ratio  of  the  total  energy  of  a  second- 
ary to  the  average  energy  available  for  each  secondary  and  ysl^Tc  ^^^  ratio  of 
the  energy  of  the  particle  to  the  total  available  energy  in  the  centre  of  mass 
system.  The  mean  value  of  e  was  obtained  as  0.65  ±  0.20  for  primaries  of  energy 
>  100  BeV.  This  is  consistent  with  the  view  that  about  a  third  of  the  available 
energy  is  carried  by  neutral  particles. 

(4)  Another  method  of  determining  /  is  to  measure  the  maximum  angle  of 
emission  of  secondary  particles  in  the  laboratory  system.  For  yg  <  y^  there 
must  exist  a  maximum  value  of  6  corresponding  to  cos  Q'  =  ^  PsIPc  such  that 
■yg  :^  yc  sin  6^max  •  ^  is  evaluated  by  assuming  that  most  of  the  secondary  particles 
are  pions.  This  method  holds  only  if 

(i)  all  the  secondary  particles  are  emitted  with  the  same  yg , 

(ii)  ^max  ^^^  ^®  measured  without  statistical  error. 

However  such  conditions  do  not  hold  in  practice.  The  accurate  determination 
of  /  requires  either  the  direct  measurement  of  angle  and  energy  or  some  extra 
condition  such  as  the  constancy  of  p^^  besides  the  identity  of  particles. 

The  lateral  spread  of  the  extensive  air  showers  (EAS) 

As  stated  earlier  we  shall  deal  only  with  the  lateral  spread  of  the  electrons 
in  electromagnetic  cascades.  Similar  considerations  apply  to  other  types  of 
showers. 

In  any  quantitative  study  we  are  interested  in  the  average  number  of  particles 
in  the  energy  range  between  E  and  E  -\-  dE  at  thickness  t.  Let  t  be  chosen  as  the 
^-axis  and  r  the  radial  distance  in  the  xy-plane.  Let  the  particle  move  in  a  direc- 
tion making  an  angle  between  0  and  6  +  dd  with  the  z-axis.  6^  and  6y  are  the  angles 
which  the  projections  of  the  particle  direction  in  the  xz-  and  yz-iglanes  make  with 
the  z-axis.  Thus  the  number  of  particles  can  be  denoted  by  7i{E,  r,  d;  t)dEdrdO 
and  when  there  are  more  than  one  type  of  particle,  similar  functions  can  be 
"defined  for  each  of  them.  To  obtain  these  functions  their  variation  with  t  will 
have  to  be  studied.  We  have  already  discussed  the  situation  in  great  detail  when 
the  distribution  in  r  and  6  have  not  been  taken  into  account.  We  include  this  by 
defining  a{(p)dq)  to  be  the  probability  that  an  electron  is  deflected  at  an  angle 
(cp,  d(p)  with  its  original  direction  in  travelling  a  thickness  d^.  a{(p)  is  an  even 


I  ^  R.  R.  Daniel  et  al.,  Bristol  Conference  on  Hyperons  and  Heavy  Mesons  (1951). 
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function  as  the  scattering  should  by  symmetric  about  the  original  direction. 
For  scattering  due  to  a  Coulomb-field  we  take 

where 

Es  =  2;r(137)V.me  =  21  MeV.  (10) 

Then  the  variation  in  the  number  of  particles  within  an  interval  6  and  B  +  dO 
caused  by  scattering  is  given  by : 

-7i{E,  r,  6;  t)  f  o  (cp)  d(p -\-  ja{q)-0)n{E,  r,  q?',  t)d(p.  (11) 

There  is  also  a  deterministic  lateral  (spatial)  displacement  Odt  and  this  implies 
that  particles  with  coordinates  r  —  Odt  at  t  move  to  r  at  t  +  dt,  i.e. 

7i{r,e',t  +  dt)  =  7i{r  —  ddt,e;t).  (12) 

The  change  in  the  number  density  at  r  due  to  variation  d^  assuming  only  this 
process  is  given  by 

.i(r,  6,  t  +  dt)^7i{r-ddt,  d;  t)=n{r,  0,  t)-ddt^^'^'  ^'  ^\        (13) 

or 

Expanding  7i{E,  r,  6  +  9?,  <)  as  a  Taylor  series  in  0,  we  obtain  (11)  as: 

-n{E,  r,  6;  t)JG{(p)d(p+  [/{:7r(^,  r,  0;  t)  +  (p^Qn{E,  r,  0;  i) 

+  ^^l7i{E,  r,  6;  0  +  •  •  •}  a((^)  d(^j .  (14) 

The  second  term  within  the  square  bracket  vanishes  as  a  is  an  even  function. 

At  this  stage,  we  make  the  following  approximation.  The  mean  square  angle  of 

scattering  is  given  by  r  /  E  \^ 

j  9.^ <!(,>)  dy  =  i(-^)  .  (15) 

All  higher  moments  of  99  are  neglected.  This  implies  that  o{(p)  rapidly  tends  to 
zero  as  (p  differs  from  zero,  i.e.  the  scattering  is  confined  to  an  extremely  small 
angle.  This  approximation  is  necessary  since  even  the  cumulative  angular  devia- 
tion 6  is  considered  small  enough  such  that  sin  0  =  0,  i.e.  the  path  length  t  tra- 
versed is  identical  with  the  z- coordinate.  If  this  approximation  is  not  made  finite 
values  of  q)  will  be  allowed  and  therefore  6  can  take  values  greater  than  7r/2  which 
implies  the  possibility  of  back  scattering.  The  process  will  not  then  be  Markovian 
with  respect  to  z  and  considerable  mathematical  difficulties  are  introduced  which 
are  not  yet  solved  even  from  a  formal  stochastic  point  of  view.^ 


^  This  point  does  not  seem  to  have  been  realised  by  Kamata  and  Nishimura  who  believe 
that  the  Landau  approximation  can  be  improved  by  replacing  d  by  sin  6.  Therefore  we  are 
not  giving  an  account  of  their  treatment  when  the  approximation  is  not  made. 
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With  the  above  approximation,  the  diffusion  equations  for  a  three-dimensional 
electron-photon  cascade  can  be  written  as: 

f{E,  r,d\EQ\t)  and  g{E,  r,  d\EQ\  t)  are  the  electron  and  photon  densities  respec- 
tively and  A',  B',C'  and  D  symbolically  refer  to  the  corresponding  expressions 
in  the  electron-photon  cascade  equations  (30)  and  (31).  If  we  take  the  Mellin 
transform  of  the  /  and  g  functions  with  respect  to  E  we  obtain  A{s),  B(s),  C{s) 
and  D  respectively.  The  above  set  of  equations  are  referred  to  as  the  Landau^ 
equations.  It  is  easily  seen  that  the  introduction  of  the  distribution  in  r  and  d 

1^2  Pi  4  pj  ■( 

has  resulted  in  the  additional  term— — ^  A^f,  Q^— — and  d,.— — in  the  diffusion 

equations.  A  general  solution  of  these  equations  has  been  obtained  by  Nishimura 
and  Kamata^  by  an  extension  of  the  method  due  to  Bhabha  and  Chakrabarthy.^ 
To  solve  these  equations  we  make  use  of  the  standard  Fourier  transform  technique 
and  define  oo 

p'(E,  n,  i\Eo>  «)  =  1^  ///(^'  "■'  «l^o;  t)e''>-'*'--'drAd         (18) 

-CX)     0 

oo 

q'{E,  ru  ^\Eo;  t)  =  ^  J  J^^"^'  "*'  ^i^«'  t)  e'^'-^^'' dr  dd         (19) 


rj  and  |  have  two  components  rj^ ,  r]2  ^^^  ^i  >  ^2  respectively. 
In  terms  of  p'  and  q',  equations  (16)  and  (17)  take  the  form 

dp' 

Je 


(20) 


Eliminating  q'  between  the  two  equations  we  have 

1  L.  Landau,  J.  Phys.,  3  237  (1940).  See  also  G.  Moliere's  article  in  Cosmic  Radiation, 
Ed.  W.  Heisenberg,  Dover  (1943). 

2  J.  Kamata  and  K.  Nishimura,  Progr.  Theoret.  Phys.  Supp.,  6,  135  (1958).  This  paper 
contains  an  extensive  reference  to  the  work  on  the  subject. 

3  H.  J.  Bhabha  and  S.  K.  Chakrabarthy,  Phys.  Rev.  74,  1352  (1948). 
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Radial  structure  functions 

The  number  of  particles  between  r  and  r  +  dr  is  given  by  2nr6.rf  (E,  r\EQ\  t) 
where  oo 


(23) 


(24) 


f'^{E,  r\Eo\  t)=  j  (E,  r,  B\E^\  t)2nede 

-oo 

Substituting  for  /  in  the  integral  in  terms  of  p' {E,r),  ^\Eq;  t)  we  get 

oo 

f^HE,  r\Eo\  t)  =  j  E-^'^'-p'iE,  ^,  0\Eo\  t)dr] 

-  00 

oo 

=  fp^'^E,  Tj,  0\Eo;  t)Jo{rir)27iridr) 

0 

where  Jq  is  the  Bessel  function  and  tj  =  j/(r/j  +  rj^)  and  r  =■■  ^(x^  +  y^). 

Thus  we  need  only  p' iE,r),0\EQ;  t)  to  obtain  the  lateral  structure  function 
f^HE,  r  \Eq;  t).  However  p'  (E,  rj,  0  \Eq',  t)  cannot  be  obtained  merely  by  setting  ^ 
equal  to  zero  because  the  derivative  of  ^'  with  respect  to  |  occurs  in  the  equation. 
Assuming  the  vector  ^  to  be  parallel  to  rj  the  equation  for  p'  (E,  ^,  li ,  ^2  =  ^  l^o*'  0 

=(^-''i*'')(-S-«-''ii)'''«--»>^      '-' 

Defining  two  new  variables  (^  and  t  by  ^  =  ^(1  —  ^')  a-nd  t  =  t^  the  equation  (25) 
becomes  .0  o  ^ 

^^^^A'^D)^  +  (A'D-B'C')]p' 

Defining  i^  as  a  Mellin  transform  of  p^  with  respect  to  E  with  5  as  a  complex 

argument,  the  solution  of  the  above  equation  in  terms  of  the  powers  of     ^^ 
and  ^jE  is  (on  writing  t'  =--  t) 

where  <Pm,n  is  defined  through  the  equation 

102  p\ 

—^  +  (A{s  +2m  +  n)  +  D)—-  +  (A{s  +  2m  +  n)  D  -  B{s  +  2m  +  n) 

xC{s  +  2m  +  n))0m,n 
=  (-^  +  i)){(C-<)^^„.-i,n+(5  +  2m  +  7i)0^.,_i}]  (28) 
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with 


and  0Q^  0  satisfies 

a 


+  {A{s)  +  D}-^  +  A{s)  D-B{s)  C{s)]0o,o-O  (30) 

with  the  boundary  conditions 


d 


^o.o(^,  0)  =  1;  ^^o,o(^.  t)\,^o=-^{s)  (31) 

corresponding  to  a  shower  initiated  by  an  electron  of  energy  Eq  .  0q  q  is  given  by 

^0,0  =  ^1  (*)  e'^^*^'  +  ^2 (<5)  e'^^*)^  (32) 

where 

/l(s)  -A2(5)  Al(5)-;i2(s) 

^1  (^)  =-  K-  (^  (^)  +  D)  +  {(A  is)  -  Df  +  4.B  [s)  C  {s)}^  (34) 

and 

}^(a)  =  \[-(A (s)  +  D)-  {(A (s)  -  Df  +  iB(s)C («)}'/.] .  (35) 

The  solution  of  (28)  is  given  by  the  recurrence  formula 

t 

+  (5  +  2m  +  7i)0^,„_i(5,  ^-t',  t')]dit'.         (36) 

The  solution  'p^^'>  has  been  obtained  as  a  double  infinite  series  involving  the  sum- 
mation over  the  discrete  variables  m  and  n.  Kamata  and  Nishimura^  have  ex- 
pressed this  as  integrals  over  the  continuous  complex  variables  u  and  v  by  first 
defining  the  function 

and  recognizing  that 

J5;2^2xm 


(  =  n  n  =  0  \         ^J^      J 


=   e-'^^^-j^-oi' ^IEF{oc,  <x',  s,  t)docdoL'         (38) 

0 

where  F{ix>(x',  s,  t)  is  a  function  which  has  the  Mellin  transform  M-i^{u,  v,  s,  t). 
The  function  M^(u,  v,  s,  t)  is  obtained  by  merely  replacing  m  and  n  by  the  two 
complex  variables  u  and  v  respectively.  The  double  series  is  uniformly  and  absc- 


1  J.  Kamata  and  K.  Nishimura,  Ibid.  (1958). 
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lut«ly  convergent  for  E  > r and  E>Spt.  M^  satisfies  the  same  equation  (28) 

as  ^m.n-  Thus  changing  the  order  of  integration  and  inverting  we  get 

l,/......(A)'i(A)-(|)-(-)-- 

r^u  +  \)  r{-v)  M^{u,  V,  8,  t) .  (39) 


8;i* 


-too 


This  solution  for  the  lateral  structure  function  is  valid  irrespective  of  the  re- 
strictions on  E.  The  lateral  structure  function  for  particles  above  energy  E  is 
given  by 

oo 

r{E,  r\Eo,  t)  =  f  f'^HE,  r\Eo',  t)dE 

H 

871* i   J      s  +  2u  +  v\E)\E,l\El\Eil  '     ' 

-  JOO 

X  r{u  + 1)  r(-  v)  jf  1  {u,  V,  s,  t) . 

At  the  limit  E  =-  0,  the  integration  with  respect  to  v  is  equal  to  the  residues  of 
the  integrands  atv=  — 5  — 2u  and  we  obtain  the  structure  function  for  the  total 
number  of  particles  as 


-  »oo 

X  r{s  +  2u)  Mi{u,  -s-2u,  s,  t) .  (41) 

The  total  number  of  electrons  is  obtained  by  integrating  over  r  in  the  above 
expression 

oo 

J P' (0,  rlE,;t)27trdr  =  ^/d« (^)'  r(s)  M, (0,  - s,  s,  t) .         (42) 

0 

Angular  structure  functions 

The  angular  structure  function  /^^^  {E,  d\EQ;  t)  for  electrons  is  given  by 

+  00 

riE,  e\Eo;t)=  f  f{E,r,  0\Eo;  t)2nrdT 

-oo 
oo 

=  fp' (6;^)  T^ii,  6)^71  idi.  '  (43) 

0 

The  solutions  /<^^  and  /<®^  are  obtained  in  a  manner  similar  to  that  of /^^^  and  /^*"^ 
These  expressions  can  be  obtained  by  replacing  r  by  6  and  M  ^  {u,  v,  s,  t)  by 
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M2{u,  V,  s,  t)  where  M2  is  a  solution  of  the  equation 


d^lde+  \A{s  +  2u  +  v)^D 


—-  +  A{s  +  2u  +  v)  D  -  B{s  +  2u  +  v] 
ot 


C{s  +  2u  +  v) 


M^iu,  V,  s;  t) 


1—+d]{uM2{u-  I,  V,  s;  t)  +  {s  +  2u  +  v)vM2{u,  v-\,  s;  t)}  (44) 


with 


Jfa(0,  0,  s;  t)  =  H^{s)  e^i^*^^  +  HM  e^^^'^K 


(45) 


It  is  to  be  noted  that  (44)  differs  from  the  equation  for  M^ .  M^  was  obtained  as 
a  limit  while  M^  is  defined  directly,  i.e. 

M^{u,  V,  s,  t)^  ^^^  ^  M{u,  V,  s,t,C- 1)  (46) 

The  approximate  values  of  the  structure  functions  have  been  obtained  by  the 
saddle-point  integration  with  respect  to  s. 


(1)  Approximation  A,  i.e.  on  neglecting  the  ionization  loss  ^  we  h§;ve 

1  „         „  2 


fin 


s-'U 


for     2 


>0 


(47) 


f^^  /-^  constant  for  2  —  s  —  "^  <  0  where  the  saddle  point  s  is  defined  by 


a  Aj  (s)  t 


ds 


>'-l-^-(fl-' 


(ii)  Approximation  B,  i.e.  on  taking  ^5  to  be  a  constant,  we  have 

1 


[r) 


for     2  -  s  >  0 


with 


d^i{s)t 


constant  for 


2-  s<0 


Pr 


>-t-'-(x)r»- 


(48) 


(49) 


(50) 


At  the  limit  Eq<- 00  the  s's  defined  above  coincide  with  what  is  termed  "the 
shower  age"  in  the  one-dimensional  cascade  theory. 

Attempts  have  also  been  made  to  obtain  lateral  distribution  function  curves 
from  empirical  considerations.  For  example,  Bethe^  has  suggested  the  radial 
distribution  function  which  represents  the  Moliere  distribution  for  5=1 


(p{r)  =  45r  -^(1  +  4r)  exp(— 4//" 


(51) 


^  H.  A.  Bethe,  see  p.  29,  Extensive  Air  Showers,  W.  Galbraith,  Butterworths  (1958). 
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r  being  measured  in  units  of  It^ ,  the  lateral  distance  through  which  an  electron 
of  energy  E^  is  scattered  in  traversing  a  radiation  length.  The  density  of  particles 
at  r  i8  then  given  by  ^  (,)  ^  ^^  ^^yj^t  ^52) 

where  N  is  the  total  number  of  particles  in  the  shower.  Another  empirical  distri- 
bution due  to  Greisen^  is  of  the  form 


(p(r)  =  c(s)r^-^{r  +  l)'--**. 


(53) 


We  have  till  now  discussed  only  the  lateral  distribution  of  soft  showers.  For 
reasons  repeatedly  mentioned,  the  corresponding  distributions  for  neutrons  and 
mesons  cannot  be  quantitatively  estimated  though  Rosenthal^  has  made  some 
calculations  using  the  Fermi  model^  for  high  energy  collisions.  His  results  are 
summarized  below: 


Height 
above  sea  level 

-^el-ph 

Hy 

^n 

^, 

R, 

3  km 
Sea  level 

100 
70 
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80 
90 

400 
600 

180  meters 
200  meters 

The  quantity  i?ei-ph  is  the  mean  square  radius  of  electron  based  on  pure  photon- 
electron  cascade  theory.  The  other  quantities  are  derived  from  nucleon  cascade 
models  and  refer  respectively  to  iV^,  jt,  ju  and  e.  It  is  to  be  noted  that  the  distribution 
of  electrons  is  largei  than  that  given  by  the  cascade  theory  and  the  /^-meson 
distribution  falls  off  less  steeply  than  that  for  any  of  the  other  components. 

The  experiments  have  shown  that  the  N  component  falls  off  in  a  similar  manner 
to  that  of  electrons  near  the  shower  axis  while  for  ^-mesons  the  fall-off  is  more 
gradual.  For  example  at  500  m  from  the  axis  ^-mesons  constitute  half  the  number 
of  shower  particles.  In  all  experiments  it  has  become  customary  to  determine  the 
relative  abundance  of  ^  for  the  various  distances  from  the  shower  axis.  The  lateral 
distribution  q  (r)  fits  the  equation 


Q{r) 


(54) 


71  =  1.6  ±  0.3  for  showers  containing  10'  particles 
n  =  2.0  ±  0.4  for  showers  containing  10®  particles. 

Despite  the  inadequacies  and  limitations  in  the  treatment  of  the  lateral  distribu- 
tion, considerable  experimental  work  is  being  done  to  study  the  density  distribution 
of  the  particles  in  the  shower  irrespective  of  their  nature.  The  most  important 
conclusion  is  that  the  extensive  showers  have  a  single  core  and  the  Moliere 
distribution  fairly  fits  the  data.  However  in  view  of  the  suggestion  of  Lewis, 
Oppenheimer  and  Wouthuysen*  regarding  the  multiple  production  of  mesons  in 


1  K.  Greisen,  Phys.  Rev.  75,  1071  (1949). 

2  I.  L.  Rosenthal,  J.  Expt.  Theoret.  Phys.  23,  440  (1952). 

3  E.  Fermi,  Progr.  Theoret.  Phys.  5,  570  (1951);  Phys.  Rev.  81,  683  (1951). 
^  H.  W.  Lewis  et  al,  Phys.  Rev.  73,  127  (1947). 
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nuclear  collisions  at  narrow  angles,  attempts  are  made  to  determine  whether 
there  are  multiple  cores  corresponding  to  the  sheaf  of  particles. 

Another  interesting  feature  is  that  the  lateral  structure  is  the  same  for  all 
showers,  that  is,  the  particle  density  is  proportional  to  the  total  number  of 
particles.  This  also  implies  that  if  the  total  number  of  particles  has  a  distribution 
function,  the  density  of  particles  has  the  same  distribution  at  any  point. 

As  has  been  previously  emphasized  one  of  the  important  aims  in  the  study  of 
extensive  showers  is  the  determination  of  the  energy  spectrum  of  the  primary 
particle.  Olbert^  has  made  some  estimates  of  the  primary  energy  from  the  number 
and  the  density  distribution  of  the  air  showers  using  the  Fermi ^  and  Landau* 
model  for  nuclear  collisions.  He  obtained  a  relationship  between  the  total  number 
of  particles  and  energy  as : 

iV  =  6  X  10-12  ^^11  using  the  Fermi  model,  and 

N  =  5  X  10-12  ^^1-6  based  on  the  Landau  model  (55) 

for  air  showers  at  sea  level.  For  a  shower  of  10^  particles,  Olbert's  estimate  of 
the  primary  energy  from  the  first  equation  would  be  3  x  10^^  eV  and  from  the 
second  5  x  10^^  eV.  Thus  the  primary  energy  is  not  sensibly  dependent  upon  the 
model  assumed  for  the  high  energy  nuclear  interactions.  Therefore  if  we  are  able 
to  determine  the  number  spectra,  this  yields  the  initial  energy  spectra.  Using 
these  the  following  result  was  obtained  by  Clarke* 

/  101^  \2-i3 
n{>  Eq)  -  2.2  X  lO-^M -— 1      cm-2  sec-i  sterad-i  (56) 

for  the  energy  region  10^^  eV  —  10^®  eV. 

4.  ENERGY  BALANCE  OF  COSMIC  RADIATION^ 

In  the  study  of  cosmic  ray  processes  we  have  assumed  only  the  known  inter- 
actions of  elementary  particles.  Since  cosmic  radiation  consists  of  extremely  high 
energy  particles  it  is  considered  worthwhile  to  investigate  whether  the  dissipation 
of  energy  of  cosmic  rays  can  be  accounted  for  by  assuming  only  the  known  inter- 
actions or  we  have  to  invoke  new  elementary  processes  hitherto  unknown.  For 
this  purpose  we  shall  calculate  the  energy  balance  after  allowing  for  the  dissipation 
through  known  processes.  For  such  a  calculation  we  need: 

(a)  The  composition  and  intensity  of  primary  radiation  at  various  latitudes 
(the  reference  latitude  is  taken  by  Puppi  to  be  50°  N).  This  helps  us  to  calculate 
the  energy  flux  of  cosmic  rays  as  they  enter  the  atmosphere  which  can  be  com- 


1  S.  Olbert,  see  G.  Clark  etal.,  see  also  S.  Olbert  and  R.  Stora,  Ann.  Phys.  1,  247  (1947). 

2  E.  Fermi,  loc.  cit. 

^  L.  Landau,  loc.  cit. 

*  G.  Clarke  et  ah.  Nature,  Lond.  180,  353,  406  (1957). 

^  This  section  is  based  on  the  article,  "The  energy  balance  of  cosmic  radiation"   by 
G.  Puppi,  Progress  in  Cosmic  Ray  Physics,  3,  341  (1956). 
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pared  with  the  energy  dissipated  in  the  atmosphere  and  underground  by  the 
various  components, 

(b)  The  nature  and  intensity  throughout  the  atmosphere  and  at  sea  level  of 
the  various  secondary  components  by  the  primary  radiation, 

(c)  The  exact  modes  in  which  the  primary  radiation  generates  the  secondary 
components  and  manner  in  which  the  various  secondary  components  transform 
into  each  other  or  are  absorbed. 

We  have  described  earlier  in  a  qualitative  way  the  nature  of  inelastic  collisions 
of  the  primary  particle  with  the  nuclei  and  the  consequent  secondary  products. 
The  charged  and  neutral  pions  and  any  type  of  heavy  meson  or  hyperon  cannot 
constitute  a  component  with  marked  intensity  anywhere  in  the  atmosphere 
because  of  their  short  mean  lifetime.  Thus  in  the  study  of  energy  balance,  by 
secondary  particles  we  mean  only  ^-mesons,  electrons,  photons  and  nucleons. 
We  will  now  take  as  the  basis  of  our  discussion  the  information  on  the  behaviour 
of  these  components  with  respect  to  latitude.  We  will  assume  also  that  the  primary 
component  consists  mainly  of  protons  and  neutrons,  neglecting  the  very  small 
fraction  of  heavy  nuclei. 

The  /Li-mesonic  component 

It  is  clear  that  we  must  possess  an  accurate  knowledge  of  the  spectrum  of  the 
//-mesons  at  some  depth  so  that  the  behaviour  of  the  //-mesons  with  depth  will 
yield  the  7r-meson  spectrum,  which  is  the  generation  spectrum  of  the  ^-mesons. 
It  will  be  seen  presently  that  the  generation  spectrum  is  an  important  quantity 
in  computing  fhe  energy  balance.  The  //-meson  spectrum  at  sea-level  is  one  of 
the  best  determined  quantities  in  the  study  of  cosmic  rays.  The  present  data  on 
the  range  spectrum  of  mesons  compiled  from  the  underground  and  the  magnetic 
spectrograph  measurements  can  be  converted  to  yield  the  differential  energy 
spectrum  at  50°  N  shown  below. 

From  this,  the  total  density  of  muons  of  any  energy  is  shown  to  be 

0.89  X  10-2  cm-2  sec-i  sterad-i 
and  the  total  energy  contained  in  the  spectrum  is  given  by 

38  MeV  cm~2  sec~^  sterad"^. 

In  order  to  arrive  at  the  generation  spectrum  of  the  //-component,  let  us  assume 
that  all  the  muons  originate  from  the  decay  of  pions  in  flight  and  that  these 
TT-mesons  are  produced  by  a  primary  nucleonic  component,  the  intensity  of 
which  decays  exponentially,  i.e.  the  production  function  of  tt's  can  be  written 
as  follows 

g„{E;t)==g„{E)^e-^l^-  (1) 

where  g„{E)  represents  the  generation  function  of  the  :7r- mesons  and  L„  their 
production  length.  The  diffusion  equations  for  7t{E;  t)dE  and  fj,{E;  t)dE  which 
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describe  the  differential  energy  spectra  of  pions  and  muons  are 

— /4(^;  t)=^{^^ti(e;  t))  -  R^{s)  fi{e;  t)  +  R„(K  e)n(K  e;  t)  (2) 

^n(EM)  =  ~{^„n(E;t))-[Q  +  R„(E)\7t(E;t)  (3) 
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Fig.  66.  Height  intensity  curves  for  the  various  components  at  30°  N. 


where  R^(E)dit  is  the  probability  that  the //-meson  decays  in  thickness  d^,i2^(ire)  d^ 
the  probability  that  a  pion  of  energy  Ke  produces  a  muon  of  energy  e,  q  is  the 
cross- section  for  the  inelastic  collision  for  the  pions. 

If  we  confine  ourselves  to  the  high  energy  //-mesons  at  sea  level  (i.e.  energy 
>  1  GeV)  we  can  assume  the  ionization  losses  to  be  constant.  It  can  be  shown 
that 
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where  b  =  w^,c/t,,  -  Hq,  B  = 


m„c 


Hq  and  the  parameters  A  and  y  can  be  determined 


from  comparison  with  spectrum  at  sea  level.  We  have  assumed  that  it  is  possible 
to  represent  the  pion  production  spectrum  as  follows 

yA 
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Fig.  67.  Number  of  particles  in  shower  at  different  altitudes  in  terms  of  the  number  at 

sea-level. 


Thus  we  conclude  that  the  n  production  spectrum  generating  mesons  of  energies 
>  1  GeV  is  23 

g„{E)  =  —^:^  (GeV)-i  cm'^  sec    ^  sterad"!.  (6) 

In  writing  down  the  last  equation  we  have  implicity  assumed  that  ^-mesons 
originate  only  from  the  decay  of  tis,.  We  will  not  here  consider  the  influence  of 
the  contribution  due  to  the  decay  of  K  particles  into  /^-mesons.  The  total  energy 
contained  in  the  production  spectrum  at  50°  N  amounts  to 

366  MeV  cm-^  sec-i  sterad-i, 

i.e.  this  amount  of  energy  goes  into  the  production  of  //-mesons  at  50°  N.  On 
multiplying  the  original  diffusion  equation  (3)  by  energy  e  and  integrating  over 
all  energies  and  over  the  whole  of  the  atmosphere  we  obtain : 

<X  (X  t^  IX 

feG{e)de=  f  efi{e\  gde  +  /d<  /  ^{e)'f^{e;  t)  de 


rrific* 


mfic 


^0  "  -0 

+  fdt  f  E{8)iLi{e;  t)  ds  +  J m^c^ ^{m^c^)  fi{m^c^;  t)  dt         (7) 


rn/iC 


where  it  has   been   assumed   that   G  {e,  A)   is  the  generating   function  for  the 
//-meson  irrespective  of  its  production  mechanism.   While  the  left  hand  side 
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represents  the  total  energy  contained  in  the  generation  spectrum,  i.e.  the  total 
energy  transferred  to  the  //-mesons,  the  terms  on  the  right  hand  side  represent 
the  various  ways  in  which  energy  is  dissipated  in  the  atmosphere,  the  first  giving 
the  residual  energy  at  sea  level,  the  second  referring  to  the  energy  loss  through 
ionization  and  electromagnetic  interactions,  the  third  and  fourth  giving  the 
energy  which  disappears  with  the  //-mesons  which  decay  in  flight  and  disintegrate 
at  rest.  A  part  of  the  energy  lost  through  the  decay  will  be  recovered  in  the  electron- 
photon  component.  The  first  term  can  be  estimated  from  the  total  energy  con- 
tained in  the  //  spectrum  at  sea  level  and  this  has  already  been  given  as 

38  MeV  cm-2  sec-i  sterad-i. 

To  calculate  the  total  energy  dissipated  in  ionization  throughout  the  atmosphere 
we  must  know  the  differential  spectra  at  various  altitudes.  However  it  is  found 
that  p  which  refers  to  a  mean  ionization  loss  over  all  the  energies  of  the  spectrum 
is  a  slowly  varjdng  function  of  the  altitude  which  we  can  estimate  from  the 
spectrum  of  //-mesons  at  sea  level.  Thus  we  find  that  the  second  term  is 
59  MeV  cm~2  sec~i  sterad"^.  In  calculating  this  we  have  taken  the  "hard"  com- 
ponent, i.e.  radiation  filtered  through  a  depth  of  atmosphere  equivalent  to  a  block 
'of  lead  of  thickness  10  to  20  cm  to  consist  entirely  of  //-mesons.  Though  this 
assumption  is  fairly  justified  in  the  lower  atmosphere,  except  for  the  loss  of  very 
slow  //-mesons,  at  higher  atmosphere  the  subtraction  of  the  nucleonic 
component  from  the  penetrating  showers  to  estimate  the  //-mesons  is  a  procedure 
with  considerable  uncertainty.  In  a  similar  way  the  third  term  yields  187  MeV 
cm~2  sec~i  sterad"^.  The  fourth  term  is  mainly  due  to  very  slow  mesons  and 
is  found  to  be  small  of  the  order  of  5  MeV  cm"^  sec"^  sterad-^.  Thus  the  total 
energy  of  the  //-mesons  is  298  MeV  cm-^  sec^  sterad"^.  To  get  the  total  energy 
contained  in  the  pion  spectrum,  we  multiply  this  quantity  by  a  factor  1.27 
and  thus  obtain  367  cm"^  sec~^  sterad"^  as  given  by  the  generation  function. 

The  calculations  have  till  now  been  made  on  the  assumption  that  all  the 

particles  travel  in  the  vertical  direction,  which  is  not  quite  true.  In  the  first  instance 

the  //-mesons  undergo  multiple  Coulomb -scattering,  but  due  to  the  exponential 

production  in  atmosphere,  the  angular  spread  is  essentially  a  function  of //-meson 

energy  at  the  point  of  production.  This  means  that  the  effect  of  multiple  scattering 

can  be  taken  into  account  by  suitably  altering  the  vertical  intensity.  The  relation 

between  P,  the  intensity  in  the  absence  of  scattering,  and  /,  that  effectively 

measured,  is  n  ^ 

7^  =  /;(l_m<02»  (8) 

where  <02>  is  of  the  order  10"^.  This  correction  is  very  small  and  does  not  have 
any  appreciable  effect  on  the  balance.  Another  small  correction  arises  from  the 
fact  that  the  //-mesons  emitted  from  7i  decay  may  be  inclined  at  an  angle  to  the 
vertical.  For  muons  of  momentum  300  MeV/c  emitted  from  pions  with  energy 
between  300  and  330  MeV,  the  maximum  angle  of  emission  is  7.4°  which  is  of 
the  order  of  the  angle  due  to  multiple  scattering. 
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Besides  these,  we  must  now  discuss  the  correction  due  to  the  angular  distribution 
of  pions  when  they  are  produced.  There  are  two  possible  limiting  cases,  viz.  a 
secondary  collimated  in  the  direction  of  the  primary  and  a  secondary  which  is 
part  of  an  isotropic  distribution.  The  latter  secondaries  will  contribute  to  /i-mesons 
when  they  are  produced  by  inclined  primaries.  However  due  to  the  lack  of  in- 
formation on  the  production  length  of  pions  in  high  atmosphere,  we  cannot 
estimate  this  important  correction  in  any  definite  manner.  Puppi^  concludes 
that  the  global  energy  contained  in  the  spectrum  after  applying  the  necessary 
corrections  is  409  MeV  cm"^  sec"*  sterad"*.  He  also  discusses  the  presence  of 
positive  excess  in  the  vertical  muons  but  concludes  that  it  only  reflects  the  parti- 
cular composition  of  primary  radiation. 

The  Electron— photon  component 

The  origin  of  the  electron-photon  component  is  rather  well  understood  and  it 
lies  mainly  in  the  bremsstrahlung  of  fast  charged  particles  like  electrons,  protons 
and  pions,  the  decays  of  muons  and  the  rapid  decay  of  neutral  pions  into  two 
photons.  The  photons  or  electrons  generated  from  these  undergo  cascade  multi- 
plication because  of  successive  bremsstrahlung  and  pair  production  processes. 
The  multiplication  reaches  a  limit  near  84  MeV  in  air  and  after  this  the  particles 
lose  their  energy  mainly  through  ionization.  Since  it  is  difficult  to  decide  whether 
the  photons  or  electrons  are  produced  directly  by  the  nucleonic  components 
or  as  secondaries  of  secondary  particles  like  the  pions,  we  will  evaluate  the 
energy  contained  in  the  electrons  and  photons  by  noting  that  the  whole  electron- 
photon  shower  energy  must  reappear  in  the  form  of  ionization  energy.  If  we  know 
the  total  electronic  track  length,  we  can  evaluate 

a.     to 

f  f  ^{E)f{E;  t)dEdt  (9) 

nieC*    0 

which  will  give  the  total  energy  lost  in  ionization  where  /{E;  t)  refers  to  the  dif- 
ferential energy  spectrum  of  the  electrons  at  depth  t.  It  has  been  shown  by  Rossi  ^ 
how  the  total  track  of  electrons  of  energies  greater  than  a  certain  limit  (s^  10  MeV 
in  the  present  caSe)  can  be  related  to  the  global  energy  of  the  primaries  which 
have  generated  the  cascade,  i.e. 

E,=  S.27tT,.  (10) 

It  must  be  remembered  however  that  part  of  the  electrons  come  from  electro- 
magnetic interactions  and  decays  of  muons  and  hence  the  energies  of  these  elec- 
trons have  already  been  accounted  for  in  the  mesonic  component.  Also  it  is  found 
that  7  per  cent  of  the  hard  component  produces  knock-on  electrons,  i.e.  it  contri- 
butes   6  MeV  cm-2  sec~*  sterad"*  to  the  total  energy.  It  has  been  found  by 


1  G.  Puppi,  ibid. 

2  B.  Rossi,  Rev.  Mod.  Phys.  20,  537  (1948).  See  also  High  Energy  Particles,  New  York, 
Prentice  Hall  (1952). 
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Puppi^  that  the  contribution  through  decay  is  62  MeV  cm"^  sec"^  sterad"^, 
using  Michel's  form  for  the  spectrum  of  decay  electrons.  Since  the  total  electronic 
track  gives  324  MeV  cm-^  sec~^  sterad^^,  we  can  conclude  that  the  contribution 
of  neutral  pions  to  the  electron-photon  component  is  256  cm"^  sec~^  sterad"^. 
In  the  above  we  have  arrived  at  the  value  for  total  electron-photon  energy  by 
means  of  a  difference  between  the  height  intensity  curves  of  the  total  ionizing 
component  and  the  mesonic  coiJaponent.  We  have  ignored  the  details  regarding 
the  shape  of  the  spectrum  of  the  electron-photon  component  and  its  variation 
with  depth.  The  information  that  we  have  regarding  the  shape  of  the  spectrum 
is  not  adequate  but  recent  experiments  of  Lovati^  show  that  the  differential 
spectrum  whether  of  photons  or  electrons  with  an  energy  greater  than  300  MeV 
at  3500  m  obey  a  power  law  with  an  exponent  25.  From  the  present  available 
information  it  can  be  inferred  that  the  integral  spectra  of  photons  and  electrons 
in  a  vertical  direction  can  be  represented  by  a  function  of  the  form 

with  1.7  ^  y  ^  1.8  and  50  ^  j5  ^  150  MeV.  It  is  also  found  that  the  photon- 
electron  ratio  is  never  less  than  unity,  the  ratio  at  sea-level  being  unity.  This 
leads  to  a  ratio  two  at  an  altitude  3500  m.  This  constitutes  a  proof  that  most 
of  the  photons  originate  in  the  decay  of  neutral  pions.  The  knowledge  of  spectra 
a.t  various  levels  coupled  with  electron-photon  ratio  at  various  altitudes  will 
permit  a  direct  calculation  of  the  global  energy  dissipated  in  the  electron-photon 
component,  instead  of  the  present  estimates  based  on  the  subtraction  method. 
As  for  the  //-mesonic  component,  we  have  to  apply  corrections  due  to  the  angular 
spread  of  electrons  due  to  multiple  Coulomb-scattering  and  the  angular  cor- 
relations between  the  primaries  and  neutral  pions  when  they  are  produced.  Due 
to  uncertainties,  we  prefer  not  to  evaluate  such  corrections  and  thus  the  total 
neutral  pion  energy  is         ^56  MeV  cm-^  sec-i  sterad-i. 

Finally  we  note  that  the  secondary  processes  like  photo-meson  production  and 
production  of  neutral  mesons  through  interactions  of  charged  mesons  give  very 
small  corrections  which  in  view  of  the  large  uncertainty  contained  in  our  estimates 
can  be  safely  neglected. 

The  nucleonic  component 

We  have  to  discuss  the  nature  of  energy  dissipation  by  the  nucleonic 
component  in  processes  different  from  generation  of  charged  and  neutral  pions. 
The  total  ionization  losses  of  the  charged  part  of  nucleonic  component  can  be 
roughly  estimated  to  be 

70  MeV  cm-2  sec"^  sterad'^. 

1  G.  Puppi,  ibid. 

2  LovATi,  see  G.  Puppi's  article,  ibid. 
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The  nucleonic  component  loses  energy  by  nuclear  excitation  resulting  in  the 
formation  of  stars.  The  frequency  N  of  nuclear  disintegration  as  function  of  the 
number  n  of  the  ionizing  prongs  obey  the  following  relations 

N(>n)==Noe-^*";         N{>n)  =  Noe-^^-^^^''-  (12) 

at  balloon  and  movmtain  altitudes  respectively.  At  mountain  altitudes  the  average 
number  of  prongs  per  star  is  3.  Puppi  estimates  from  the  various  emulsion  data 
that  the  global  number  of  disintegration  in  an  atmospheric  column  of  unit  section 
is  about  3  cm"2  sec  ^  This  number  should  be  increased  to  allow  for  the  undetected 
disintegrations.  It  is  known  that  for  every  nuclear  disintegration  with  N  ionizing 
particles,  there  are  about  1.5  N  neutrons  which  implies  that  the  total  production 
of  neutrons  is  4.5  cm^  sec-^  in  unit  atmospheric  column.  Still  we  have  to  correct 
the  above  figures  by  the  disintegrations  in  which  only  neutrons  are  emitted  and 
by  events  which  are  excluded  by  the  selection  device  for  ionizing  particles.  The 
time  variation  experiments  of  Simpson  ^  have  revealed  considerable  information 
on  the  behaviour  of  slow  neutrons.  These  slow  neutrons  originate  mainly  in  low 
energy  stars  and  they  slow  down  to  thermal  energy  due  to  collisions  with  air 
nuclei  when  they  are  captured  by  nitrogen  in  the  reaction  N^*  {n,  p)  -^  C^"^. 
Though  the  number  of  thermal  neutrons  captured  per  gram  of  air  at  various  levels 
can  be  estimated,  for  the  energy  balance  we  are  interested  in  the  total  number 
of  neutrons  produced  in  nuclear  disintegrations.  A  fraction  of  neutrons  escape 
from  the  atmosphere  and  a  very  large  portion  does  not  reach  thermal  energies. 
Bethe's^  analysis  indicates  that  to  justify  the  number  of  thermal  captures  in  the 
atmosphere  which  is  3.5  cm"^  sec~^  the  number  of  neutrons  produced  in  nuclear 
disintegrations  should  be  much  greater,  e.g.  the  number  of  neutrons  generated  in 
nuclear  disintegrations  with  the  kinetic  energy  of  4  MeV  is  about 

(22  ±  4)  cm-2  sec-i. 

This  remark  helps  us  to  interpret  the  variation  with  height  in  atmosphere  of 
neutron  capture  frequency.  We  find  that  the  estimates  based  on  the  thermal 
captures  do  not  agree  with  those  obtained  from  the  frequency  of  disintegration 
with  ionizing  particle  and  in  fact  the  former  is  greater  by  a  factor  of  five.  We  have 
to  conclude  that  the  major  source  of  slow  neutrons  are  events  in  which  only 
neutrons  are  emitted.  Puppi ^  estimates  that  the  total  energy  contribution  from 
all  types  of  nuclear  disintegration  to  the  vertical  component  is  230  MeV  cm"^ 
sec~^  sterad"^.  This  value  is  very  uncertain  as  we  have  not  considered  the  even- 
tual low  energy  particles  such  as  protons,  alpha  particles  and  deuterons. 

The  above  analysis  of  the  vertical  beam  of  cosmic  rays  at  50°  N  gives  a  total 
energy  dissipated  to  be  965  MeV  cm-^  sec~^  sterad"^,  through  the  three  com- 
ponents, i.e.  mesons,  electron-photon  and  the  nucleonic  component.  The  total 
incoming  primary  energy  is  estimated  to  be  around  1400  MeV  cm"^  sec~^  sterad"^. 


1  J.  A.  Simpson,  Phys.  Rev.  76,  1750  (1949). 

2  H.  A.  Bethe,  C.  Forff  and  G.  Placzek,  Phys.  Rev.  57,  573  (1940). 

3  G.  Puppi,  ibid. 
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In  view  of  the  large  uncertainties  in  the  estimates  on  the  low  energy  nucleonic 
component  and  nucleonic  cascades  of  other  particles  the  disagreement  between 
the  dissipated  and  primary  energy  need  not  disturb  us.  Thus  we  can  conclude 
there  is  not  sufficient  energy  balance  after  allowing  for  dissipation  through  known 
processes  to  warrant  any  assumption  of  hitherto  unknown  processes. 


Fig.  68.  Energy  spectrum  of  primary  cosmic  ray  particles. 


CHAPTER  XIV 


ORIGIN  OF  COSMIC  RADIATION 


By  far  the  most  interesting  question  about  cosmic  radiation  relates  to  its  origin. 
Till  recently  the  theories  of  the  origin  of  cosmic  radiation  were  in  a  speculative 
stage  since  very  few  criteria  had  to  be  satisfied  with  the  hitherto  available  observa- 
tional data  on  the  primary  components  of  cosmic  radiation.  However  since  1948 
our  knowledge  of  the  composition  of  the  primary  components  in  the  neighbour- 
hood of  the  earth  has  become  more  precise  'and  conclusions  about  the  distri- 
bution of  cosmic  radiation  in  the  galaxy  and  beyond  have  become  possible  from 
the  data  on  radio-astronomy. 

It  is  convenient  to  deal  with  the  life  history  of  the  cosmic  radiation,  before  it 
reaches  the  earth,  in  two  stages. 

The  first  relates  to  the  primitive  source  of  cosmic  radiation,  that  is,  where 
the  particles  are  first  produced. 

The  second  relating  to  the  intermediate  mechanism  through  which  these 
particles  attain  the  energy,  composition  and  spatial  distribution  characteristic 
of  the  radiation  that  reaches  the  earth. 

An  intermediate  mechanism  implies  the  following : 

(a)  There  must  be  a  method  by  which  the  injected  particles  are  accelerated 
to  the  very  high  energies  found  in  cosmic  rays, 

(b)  there  must  be  a  region  of  space  in  which  the  cosmic  rays  are  trapped  or 
stored  for  the  acceleration  mechanism  to  be  effective, 

(c)  there  must  be  a  mechanism  of  escape  which  could  explain  the  energy 
spectrum. 

These  three  factors  will  naturally  depend  on  the  sources  of  injection.  It  is 
obvious  that  if  the  primitive  source  were  extragalactic,  the  storage  domain  should 
be  the  intergalactic  space ;  if  the  source  were  galactic,  the  domain  must  be  inter- 
galactic,  and  if  the  source  were  solar,  the  domain  must  be  the  region  in  the  neigh- 
bourhood of  the  earth  and  the  sim.  It  would  be  worthwhile  to  compare  the  density 
of  the  cosmic  rays  with  the  various  forms  of  energy  available  in  the  universe 
before  making  any  postulate  about  the  source  of  the  cosmic  rays. 


^  See  "The  Origin  of  Cosmic  Radiation".  By  V.  I.  Ginzburg,  Progress  in  Cosmic  May 
Physics,  4,  339  (1958). 

We  are  deeply  indebted  to  Professor  S.  Chandrasekhar  for  giving  us  his  private  notes  on 
the  theories  of  origin  of  Cosmic  Rays.  Also  we  wish  to  thank  Professor  Phillip  Morrison  whose 
lecture  at  Madras  clarified  many  difficulties  on  this  subject. 
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Source 

Energy  density  C/  in  Me  V 

Cosmic  rays 
Solar  light 

Turbulence  in  galactic  gas 
Galactic  rotation  (systematic) 
Mass  energy  (rest  mass  in  space 
and  stars-averaged  over) 

1 

1 

10  -  100 

10+3 

10+ » 

Magnetic  energy 

1 

The  original  surmise  that  the  turbulence  energy  of  the  universe  can  be  considered 
as  a  primitive  soiu'ce  was  given  up  since  it  was  realised  that  the  production  of 
the  cosmic  rays  would  affect  the  hydrodynamics  of  the  gas.  The  rotational  energy 
is  also  ruled  out  since  there  is  no  known  mechanism  of  converting  this  energy 
into  the  kinetic  energy  of  single  protons.  The  density  of  cosmic  rays  being  unity 
the  energy  of  the  source  must  be  of  much  higher  order  and  hence  it  is  natural 
to  trace  it  to  the  mass  energy. 

An  extreme  postulate  was  made  by  certain  workers  on  cosmic  radiation  that 
no  intermediate  mechanism  is  operative  but  the  cosmic  rays  are  produced  with 
the  known  characteristics  directly  in  a  catastrophic  production. 

In  the  theory  of  catastrophic  production,  it  is  assumed  that  high  energy  particles 
are  directly  produced  in  some  forms  of  stellar  explosions.  Originally  the  following 
were  considered  the  obvious  objections  to  a  catastrophic  theory. 

(i)  Since  the  energy  density  of  cosmic  radiation  is  comparable  to  that  of  star- 
light, if  we  believe  that  the  radiation  fills  the  universe,  the  total  energy  is  com- 
parable to  the  rest  energy  of  universal  matter  and  it  seems  untenable  that  matter 
is  being  annihilated  at  such  a  rate. 

(ii)  The  radiation  is  isotropic  and  it  seems  difficult  to  suggest  a  mechanism 
by  which  the  particles  ejected  in  stellar  explosions  can  be  rendered  isotropic. 

(iii)  Any  suggested  catastrophic  process  by  its  very  nature  must  be  ad  hoc 
and  its  existence  cannot  be  proved.  The  most  powerful  argument  against  cata- 
strophic production  is  the  existence  of  heavy  primaries  which  have  the  same 
energy  per  nucleon  as  the  protons.  It  is  extremely  hard  to  believe  that  any 
catastrophic  process  which  gives  a  velocity  of  many  billion  electron  volts  per 
nucleon  should  not  provide  enough  energy  to  disrupt  the  nucleus.  But,  we  shall 
see  that  all  these  objections  can  be  met  effectively  by  introducing  an  inter- 
mediate mechanism  involving  gradual  acceleration.  Among  other  reasons  for 
postulating  the  existence  of  an  intermediate  mechanism  the  following  one  stands 
out  as  the  most  important. 

Relative  Abundances  of  Elements 


Elements 

Cosmic  Rays 

Sun 

H 

He 

Z>2 

Electrons 

1 

0.1 

0.03 

<10-2 

1 

0.01 

^0.001 

1.3 
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An  examination  of  the  table  of  relative  abundance  of  the  various  elements 
reveals  that  there  are  practically  no  electrons  in  the  cosmic  rays.  Since  electrons 
should  also  be  produced  in  the  processes  which  give  rise  to  the  protons  the  absence 
of  electrons  in  cosmic  rays  is  explained  as  follows : 

The  relativistic  electrons  which  we  can  expect  to  be  produced  along  with  other 
particles  that  are  actually  observed  in  cosmic  rays  lose  their  energy  by  magnetic 
bremsstrahlung  while  getting  accelerated  by  the  interstellar  magnetic  fields. 
The  evidence  for  this  is  found  in  the  cosmic  radio  noise. 

The  shape  of  the  distribution  of  the  cosmic  ray  particles  with  energy  is  found 
to  be  as  follows.  There  is  a  definite  cut-off  at  the  low  energy  and  at  about  10^  MeV 
and  the  spectrum  is  practically  linear  over  a  wide  energy  range.  There  is  no  evi- 
dence for  a  cut-off  at  the  high  energy  end  of  the  spectrum  which  is  not  well  defined. 
But  it  is  natural  to  expect  such  a  cut-off  since  the  relativistic  protons  which 
comprise  the  major  portion  of  the  cosmic  radiation  can  also  lose  energy 
by  magnetic  bremsstrahlung  and  the  cut-off  is  expected  to  be  at  about  10^°  MeV. 
In  view  of  the  wide  range  of  the  energy  spectrum  of  the  cosmic  radiation  ranging 
from  10®  to  10^®  eV  it  is  obvious  that  we  should  not  seek  a  single  region  of  the 
universe  as  a  possible  primitive  source.  There  is  reason  to  expect  that  the  low 
energy  region  of  the  spectrum  is  of  solar  origin,  the  straight  line  portion  from 
10®  to  10^®  eV  of  galactic  origin,  sources  of  extragalactic  nature  giving  rise  to 
particles  of  energy  even  greater  than  10^^  eV.  Before  discussing  the  possible 
sources,  we  shall  give  an  accoimt  of  the  methods  of  acceleration. 


1.  THEORIES  OF  INTERMEDIATE  MECHANISM 

Gradual  acceleration  of  charged  particles  can  only  be  realized  either  by  electro- 
static or  non-static  magnetic  fields.  The  possibility  of  acceleration  by  an  electric 
field  is  immediately  dispensed  with  by  the  foUowiag  considerations : 

The  potential  difference  for  producing  such  high  velocities  has  to  be  very 
high,  but  in  regions  of  high  conductivity  which  do  occur  in  galactic  and  inter- 
galactic  space,  it  would  not  be  easy  to  maintain  such  large  fields.  A  quantitative 
idea  of  the  dissipation  of  electrostatic  fields  is  obtained  directly  by  considering 
Maxwell's  equations.  If  o  is  the  conductivity  of  the  gas,  the  current  j  is  given  by 
<yE  and  therefore  we  obtain 

c  V  dt  "J 

1  \  dE  1 

(1) 


-4( 


^^     .    .1 

Taking  the  divergence  on  both  sides, 

r  7)  ^ 

(2) 


P.(FxH)  =  0  =  - 
c 
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neglecting  the  derivative  of  c  and  noting  that  V  •  E  =  4:71q  we  obtain 

-j-=  -47iag     or     Q^e-^""^^,  (3) 

i.e.   the   charge  distribution  decays  with  a  mean  lifetime .  In  an  ionized 

4:7ia 

gas,  the  conductivity  has  the  value  a  =  NXe^jmv  where  iV  is  the  electron  density, 

X  the  mean  free  path,  m  the  mass,  v  the  average  velocity  and  e  the  electronic 

charge.  Assuming  (i)  that  the  interstellar  space  contains  one  hydrogen  atom 

per  cm^,  (ii)  that  this  hydrogen  is  completely  ionized  (although  actually  only 

a  fraction  is  ionized),  and  (iii)  that  the  kinetic  temperature  of  the  electron  is 

a  few  degrees  absolute,  we  find  V/^10®cmsec~^  and  A/^10^  cm  and  therefore  we 

get  a  =  10'^  sec~i.  Since  the  velocity  is  proportional  to  T^  and  the  mean  free 

path  to  T^ ,  the  conductivity  increases  as  T^ .  At  higher  temperatures  we  shall 

therefore  expect  higher  conductivities,  say  values  exceeding  g  =  10^^  sec~^. 

In  a  completely  ionized  gas,  the  product  iVA  is  independent  of  the  density 
and  thus  the  conductivity  does  not  depend  on  the  assumed  number  of  hydrogen 
atoms  per  cubic  centimetre.  Even  for  incomplete  ionization  the  conductivity  will 
remain  practically  unchanged  as  long  as  the  total  scattering  cross-section  of  the  ions 
is  greater  than  that  of  the  neutral  atom.  Actually  an  ionization  as  low  as  one  in  10^^ 
will  give  the  same  product  iVA  as  we  obtained  for  complete  ionization  and  a  low 
temperature  of  a  few  degrees  absolute.  The  low  number  10 ~^^  of  electrons  per 
proton  could  easily  arise  from  even  such  rare  elements  as  Cs  or  Rb.  These  can  be 
ionized  by  radiation  of  3000  A  and  such  radiation  is  surely  present  throughout 
the  galaxy.  Due  to  ionization  of  some  of  the  more  abundant  elements  like  Na, 
K  and  Ca  which  are  rather  more  difficult  to  ionize,  it  is  believed  that  the  number 
of  electrons  per  cm^  does  not  become  lower  than  10"^  times  the  number  of  protons 
per  cm^  anywhere  within  the  galaxy.  Thus  we  may  consider  that  the  lower  bound 
of  the  conductivity  a  in  the  interstellar  spaces  in  the  galaxy  is  of  the  order  of 
10^  sec"^.  This  of  course  does  not  mean  that  electric  fields  cannot  persist  for  times 
longer  than  10  '^  sec,  but  it  does  mean  that  extended  electrostatic  fields  must  not 
be  expected  anywhere  within  the  galaxy. 

Therefore  two  modes  of  acceleration  involving  magnetic  fields  have  been 
proposed : 

(i)  The  betatron  mechanism  of  Swann^,  (ii)  Fermi's  ^  mechanism  of  energetic 
collisions  of  the  charged  particles  with  "wandering"  magnetized  interstellar 
matter. 

Swann's  mechanism 

The  main  merit  of  this  mechanism  lies  in  its  simplicitj^  The  method  of 
acceleration  can  be  described  in  the  following  terms. 


1  W.  F.  G.  SwANN,  Phys.  Rev.    43,  217  (1933). 

2  E.  Fermi,  Ap.  J.  119,  1  (1954). 
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Let  US  assume  a  magnetic  field  in  the  z  direction  and  let  it  increase  with  time 
throughout  space.  liN^  be  the  magnetic  flux  in  a  cylinder  of  radius  r,  the  electric 
field  Eq  produced  tangential  to  the  surface  of  the  cylinder  is  given  by 

If  V  is  the  velocity  of  the  particle  and  T  its  kinetic  energy 

dT       „  ve        dNr  mv^       Hev 

If  we  assume  the  particle  to  have  relativistic  velocities,  the  kinetic  energy  is 
given  by  the  well-known  relation 

mH^c^  =  T2  +  2mQC^T.  (6) 

From  these  four  equations  we  obtain 

271      dt  dt  ^^ 

where  Hf,  is  the  magnetic  field  at  the  point  where  the  electron  is  foimd.  Since, 

r 

by  definition,  N  =  J  H  -  271q  de  we  obtain 
u 

A|^,d,  =  A[^^.,.j.  (8) 

This  implies  that  rH(r)  is  constant  for  all  values  of  t.  In  other  words,  it  implies 
the  particle  moves  in  a  spiral  such  that  the  percentage  time  rate  of  change  of  field 
at  all  points  in  its  trajectory  is  the  same.  From  the  above  theory  it  is  clear  that 
even  with  H  growing  up  only  to  2  x  10"^  gauss,  provided  r  is  very 
large  (3  x  10^^  cm),  we  obtain  kinetic  energies  of  the  order  of  1.7  x  10^^  eV. 
Two  main  criticisms  have  been  advanced  against  this  theory: 

(1)  Whether  such  changing  magnetic  fields  exist  in  the  universe, 

(2)  Criticism  by  Teller^  using  a  relativistic  argument  which  runs  as  follows: 
The  motion  of  magnetic  lines  of  force  through  stationary  matter  gives  rise  to 

an  electric  field  E  = where  v  is  the  velocity  of  motion  of  the  magnetic 

c 

lines  of  force.  The  conclusion  of  the  previous  paragraph  means  that  in  the  coordi- 
nate system  in  which  the  conducting  matter  is  at  rest,  v  is  zero  and  therefore 
no  electric  field  will  be  felt  in  the  system  in  which  the  conducting  gas  is  at  rest. 
In  the  presence  of  a  magnetic  field  charges  may  flow  freely  along  the  magnetic 
lines  of  force.  This  means  that  the  component  of  the  electric  field  which  is  parallel 
to  the  magnetic  field  will  disappear  in  materials  of  high  conductivity  within  a 
very  short  time.  If  the  electric  and  magnetic  fields  are  perpendicular,  two  cases 


1  E.  Teller,  i?ej9.  Progr.  Phys.  17,  154  (1954). 
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should  be  distinguished  according  as  whether  E  or  H  is  greater,  measured  in  the 
C.G.S.  system  of  units.  If  E  is  greater  a  coordinate  system  can  be  found  moving 

with  the  velocity     ,  „,     in  which  the  magnetic  field  vanishes  but  an  electric 

field  is  present.  In  this  coordinate  system,  the  electric  field  will  again  vanish 
within  a  very  short  time  and  we  can  therefore  exclude  the  possibility  of  |^  | 
being  greater  than  |^  |.  If  the  magnetic  field  is  larger,  then  a  coordinate  system 

can  be  found  moving  with  a  velocity  c  4-^  in  which  a  magnetic  field  is  present 

but  the  electric  field  vanishes.  In  this  coordinate  system,  positive  and  negative 
ions  will  move  in  circles  with  opposite  senses  of  rotation.  Thus  in  this  coordinate 
system,  the  ions  will  not  move  relative  to  the  lines  of  force  of  the  magnetic  field. 
Given  a  sufficient  length  of  time  in  which  ions  and  neutral  material  may  come 
to  equilibrium,  a  situation  will  be  established  in  which  no  electric  field  is  present 
and  no  "wind"  is  felt  in  our  new  coordinate  system.  In  another  coordinate  system 
both  the  electric  and  magnetic  fields  will  be  present,  while  positive  and  negative  ions 
will  move  in  cycloids  with  opposite  sensesofrotationbut  equal  velocities  of  trans- 
lation. The  above  discussion  leads  to  the  conclusion  that  changing  magnetic 
fields  will  produce  electric  fields,  but  these  electric  fields  will  impart  a  motion 

c    ^1  to  the  conducting  matter  as  a  whole  and  will  not  lead  to  acceleration  of 

individual  ions  which  could  be  identified  with  the  primary  particles  of  cosmic 
radiation. 

Without  referring  to  the  relevance  of  this  criticism  to  Swann's^  theory,  Teller's 
remarks  themselves  are  open  to  objection. 

Since  the  electric  field  changes  with  t  the  velocity  of  the  moving  frame  has  to 
be  continuously  changed,  that  is,  it  becomes  an  accelerated  frame  to  which  special 
relativistic  theories  do  not  apply. 

Magnetic  fields  unlike  electric  fields  are  conserved  for  long  periods  due  to  the 
effects  of  self  induction  in  a  conductor  of  large  dimensions  like  the  ionized 
material  in  the  interstellar  space.  This  is  easily  seen  if  we  take  the  curl  on  both 

1    dH 

sides  of  equation  (1)  and  the  time  derivative  of  the  equation  p  x  £=  — ^— . 

Neglecting  the  derivative  of  o  and  using  the  fact  that  J7  •  H  =  0  we  obtain 

p.H^^^^  +  ±"^^.  (9) 

c2   dt^  ^    c2     dt  ^^ 

We  can  understand  the  time  dependence  of  H  by  neglecting  one  of  the  three 
terms  of  the  equation  (9) 

(1)  If  P^H=0,     i/=Hoe-*^^^ 
This  is  the  magnetic  field  induced  by  a  decaying  electric  field  with  a  charge 
distribution  given  in  equation  (3). 


1  W.  F.  G.  SwANN,  Ibid. ;  E.  Fermi,  Ibid. 
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(2)  If  dH/dt^i),  the  equation  is  that  of  the  propagation  of  electromagnetic 
radiation. 

(3)  If  -y  2 ^  0  we  have  a  slowly  changing  or  quasistatic  magnetic  field  which 

is  of  great  interest  to  us  since  interstellar  magnetic  fields  are  probably  of  this 
type.  Then  the  equation  reduces  to  a  diffusion  equation  with  diffusion  coefficient 

■"^'  y^„^^/.^.  (10, 

C  dv 

If  /y  is  a  linear  dimension  within  which  considerable  spatial  variation   has 

4:7lO 

originally  been  present,  t^ — T"-^^  ^®  ^^^  ^^®  ^^  which  the  magnetic  field  is 

c 

dissipated.  Using  forcj  its  minimum  value  10'  sec"^  and  for  L  the  value  3  x  lO^^cm, 

we  find  /=  1.2  X  10®  sec  --  4  years.  If  Z  is  a  light  day,  we  obtain  /  =  3  x  10^** 

years  which  is  greater  than  the  age  of  the  universe.  Thus  for  distances,  quite 

modest  on  a  galactic  scale,  magnetic  fields  remain  tied  to  the  conducting  gas. 

It  follows  therefore  that  these  magnetic  fields  can  constitute  a  mechanism  for 
trapping  cosmic  ray  particles  within  the  galaxy  until  the  energy  density  is  built 
up  to  the  required  order  of  magnitude. 

The  observations  of  Hiltner  and  Hall^  of  an  appreciable  polarization  of  light 
coming  to  us  from  distant  stars  has  been  interpreted  as  due  to  the  orientation  of 
the  non-spherical  dust  grains  by  a  magnetic  field.  On  this  interpretation,  the 
polarization  gives  us  some  information  on  the  strength  and  direction  of  the 
magnetic  field.  Hiltner's  measurements  indicate  that  in  the  vicinity  of  the  earth, 
the  magnetic  field  is  approximately  parallel  to  the  direction  of  the  spiral  arm  of 
the  galaxy.  This  fact  suggests  that  we  may  perhaps  think  that  the  spiral  arms 
are  magnetic  lines  of  force.  The  direction  of  polarization  of  starlight  indicates 
further  that  in  our  vicinity  the  magnetic  lines  of  force  show  irregular  deviations 
from  parallelism  of  the  order  of  10°.  This  fact  excludes  the  hypothesis  that  the 
lines  of  force  yield  completely  to  the  turbulent  motions  of  interstellar  matter 
because  then  they  would  be  rapidly  bent  into  shapes  much  more  irregular  than 
observed.  We  are  forced  to  conclude  that  the  field  is  sufficiently  strong  to  yield 
only  a  little  to  the  transverse  component  of  the  turbulent  motion.  Indeed,  as  was 
pointed  out  by  Davis  ^,  the  small  deviations  from  parallelism  of  the  field  enable 
us  to  estimate  the  field  to  be  of  the  order  of  10 ~^  gauss.  Recently  Chandrasekhar 
and  Fermi^  have  re-examined  the  problem  considering  in  particular  the  balance 
between  magnetic  and  gravitational  effects  in  the  spiral  arm.  They  conclude 
that  the  field  intensity  is  6  x  10  ~*  gauss.  Owing  to  the  turbulence  the  lines  of 


1  HiLTNEE  and  Hall,  Science  109, 165  (1949) ;  J.  S.  Hall,  Science  109,  166  (1949) ;  J.  S.  Hall 
and  A.  H.  Miksell,  Ap.  J.  54,  187  (1949). 

2  L.  Davis,  Phys.  Rev.  100,  1440  (1955),  101,  351  (1956). 

3  S.  Chandrasekhar  (private  communication);  S.  Chandrasekhar  and  E.  Fermi,  Ap.  J. 
118,113(1953). 


ORIGIN    OF    COSMIC    RADIATION  485 

force  are  laterally  deformed  until  the  magnetic  field  increases  to  the  point  of 
forcing  a  reversal  of  the  motion  of  the  material  and  pushing  back  the  diffuse 
matter  and  imposes  on  it  some  kind  of  very  irregular  oscillatory  motions.  It  is 
therefore  expected  that  the  lines  of  force  will  sway  back  and  forth  and  also  that 
the  field  intensity  will  fluctuate  along  the  same  lines  of  force.  Thus  we  are  inclined 
to  believe  that  the  magnetic  field  constitutes  the  accelerating  mechanism. 

Fermi's  mechanism  of  acceleration  of  particles 

Fermi  ^  suggested  a  mechanism  of  acceleration  due  to  the  interaction  of  cosmic 
particles  with  wandering  magnetic  fields  in  interstellar  spaces.  Such  fields  have 
remarkable  stability,  as  we  found  earlier,  because  of  the  large  dimensions  and 
high  conductivity  of  the  interstellar  space.  Indeed  the  conductivity  is  so  high 
that  one  might  describe  the  magnetic  lines  of  force  as  attached  to  the  matter 
and  partaking  in  its  streaming  motions.  On  the  other  hand,  the  magnetic  field 
itself  reacts  on  the  hydrodynamics  of  interstellar  matter  which  according  to 
Alfveii  2  can  pictorially  be  described  by  saying  that  to  each  line  of  force  we  should 
attach  a  material  density  due  to  the  mass  of  matter  to  which  the  line  of  force 
is  linked.  On  this  point  of  view,  Alfven  obtained  a  simple  formula  for  the  velocity 
of  propagation  of  magneto-elastic  waves,  v  =  ^o(47i;^o)'^%  ^o  being  the  intensity 
of  the  magnetic  field  and  q^  the  density  of  interstellar  matter. 

Now  consider  a  fast  particle  moving  in  such  wandering  magnetic  fields.  If  the 
particle  is  a  proton  having  an  energy  of  a  few  BeV  it  will  spiral  round  the  lines 
of  force  with  the  radius  of  the  order  of  10^^^  cm  until  it  "collides"  against  an 
irregularity  in  the  cosmic  field  and  undergoes  some  kind  of  irregular  motion. 
On  a  collision  both  gain  or  loss  of  energy  may  take  place,  gain  being  more  probable 
as  is  understood  by  the  argument  that  ultimately  statistical  equilibrium  must  be 
established  between  the  degrees  of  freedom  of  the  wandering  field  and  the  de- 
grees of  freedom  of  the  particle.  Equipartition  may  lead  to  an  unbelievably  high 
energy  and  a  limitation  to  this  process  is  imposed  by  the  enormous  times  that 
are  required  to  attain  such  high  energies.  A  charged  particle  will  be  spiralling 
round  a  uni-directional  magnetic  field  with  the  axis  of  the  spiral  parallel  to  the 
field.  We  shall  give  here  a  dynamical  argument  to  show  that  there  is  a  gain  in 
the  energy  of  a  particle  in  a  collision  with  a  cloud. 

If  the  radius  of  the  spiral  is  of  the  order  of  10^^  cm  and  the  irregularities  of 
the  field  are  of  the  order  of  10^^  cm,  the  particle  will  perform  many  turns  on  its 
spiralling  path  before  encountering  an  appreciably  different  field  intensity.  As 
the  particle  approaches  the  region  where  the  field  intensity  increases,  the  pitch 
of  the  spiral  decreases  since  ^in^QjH  is  a  constant  where  0  is  the  angle  between  the 
direction  of  the  line  of  force  and  the  direction  of  the  velocity.  The  particle  is 
"reflected"  at  the  point  where  sin  0  =  1.  If  the  magnetic  field  were  static,  this 
reflection  will  not  produce  any  change  in  the  kinetic  energy  of  the  particle. 

1  E.  Fermi,  Ihid. 

^  H.  Alfven,  Cosmical  Electrodynamics,  Oxford  (1950). 
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However,  the  particle  will  gain  energy  if  a  region  of  high  field  intensity  moves 
towards  it  and  lose  energy  if  the  region  of  high  field  intensity  moves  away  from  it. 
The  net  result  will  be  an  average  gain,  primarily  for  the  reason  that  head-on 
collisions  are  more  frequent  than  overtaking  collisions  because  the  relative  velo- 
city is  larger  in  the  former  case.  The  amount  of  energy  gained  or  lost  in  a  collision 
of  the  two  types  described  can  be  estimated  by  a  simple  argument  using  the 
concepts  of  special  relativity  without  any  reference  to  the  detailed  mechanism 
of  collision.  In  the  frame  of  reference  8*  in  which  the  irregularity  of  the  field 
against  which  the  collision  takes  place  is  at  rest,  there  is  no  change  in  the  energy 
of  the  particle.  The  change  of  energy  in  the  original  frame  S  is  obtained  by  th(> 
following  method:  We  first  transform  the  initial  energy  and  momentum  from 
S  to  S\  In  S*  an  elastic  collision  takes  place  whereby  the  momentum  changes 
direction  and  the  energy  remains  unchanged.  Transforming  back  to  S,  one  obtains 
the  final  values  of  energy  and  momentum. 

Let  the  ''irregularity  "  move  with  a  velocity  v  =  Bc  and  the  particle  with  a  velocity 
u  =  pc  and  a  momentum  P^  in  8.  In  /S'  the  momentum  P^  and  energy  E'  is  given  by 

E  +  BeP, 


gi^enby  jcP^  +  BE 

C  £   . 


After  collision  in  8'  the  momentum  and  energy  of  the  particle  are  P^'  and  E'* 

Transforming  back  to  the  original  frame  8  we  obtain  P'J'  and  E'"  of  the  particle 
after  collision 

cp;"=^'-l4fl  (15) 


E'"  = 


1/1-^ 
E"  -BcP'^ 

E  +  BcP^       ^cP^  +  BE 


1/1-^2  1/1-^2      I     |/1_52 

^{E-]-2BcP^  +  B^E)l{l-B^)  (16) 

For  the  particle  with  velocity  u  and  energy  E  we  have 

_  mcux  „  mc^  cPx       u^       o       f^        ,^c,s 

yi  —  u^jc^  \\  —  u^jc^  E         c 


E'"  _  l  +  2BPco8d  +  B^ 


(19) 
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For  an  overtaking  collision  we  get  a  similar  formula  except  that  the  sign  of  B 
must  be  changed 

--- j  =  2  52  ^  2  5  ^  cos  0  -  2  52  ^2  cos2  d  (head  on  collisions) 

=  2  52  _  2  5  ^  cos  0  -  2  52  ^2  cos2  e  (overtaking  colhsions) .    (20) 

^cosd  +  B 

Probability  for  the  head  on  collision  is  — x-5 ^ —  and  the  probability  for  the 

„.  .       ^Gosd-B  ^i^cos^^ 

overtaking  collision 


E" 
log 


E 


2/5cos(9 
^         [{2 B^  +  2 B P cosO  -  2 B^ ^^ cos^O)  iP cosO  +  B) 


2/5cos6 

+  (252-2i^i5cos0-2J52^2cos2(9)(^cos0-5)] 

=  452-252^2cos20, 


i.e.  log 


E+dE 


(^^^)  =  cB^     ,0-1).  (21) 


The  average  gain  in  energy  per  collision  is  of  the  order  of  magnitude  d(E)  =  B^E. 
If  E  is  the  energy  of  the  particle  inclusive  of  the  rest  energy  the  average  gain 
in  energy  per  collision  is 

dE  -  B^E  where  B  =  —^  lO^.  (22) 

A  particle  starting  with  non-relativistic  energy  will  therefore  after  N  collisions 
attain  the  energy 

E  ==  Mc^ e^'  ^,        N  being  very  large.  (22  a) 

To  explain  the  energy  spectrum,  Fermi  introduced  an  absorption  process  due  to 
collisions  with  matter  of  an  average  density  of  10"24  g/cm^  leading  to  an  ab- 
sorption mean  free  path  A  =  7  x  102^  cm.  The  average  lifetime  of  a  cosmic  ray 
particle  is,  let  us  say,  T  which  is  approximately  2  x  10^^  sec  /^  10  million  years. 
The  cosmic  ray  particles  now  present  will  therefore,  on  an  average,  have  this 
age.  Some  of  them  will  accidentally  escape  destruction  and  will  be  considerably 
older.  Indeed  the  absorption  process  can  be  considered  to  proceed  according  to 
an  exponential  law.  If  we  assume  that  the  original  particles  at  all  times  have 

been  supplied  at  the  same  rate,  we  expect  the  age  distribution  to  be  e'*^'^-™-. 

During  this  age  t  the  particle  has  been  gaining  energy.  If  we  call  t  the  time  between 
scattering  collisions,  the  energy  acquired  by  a  particle  of  age  t  will  be 

E{t)  =  Mc^eB'il^ .  (23) 

Hence  the  probability  that  the  particle  has  energy  between  E  and  ^  +  d^  is 
given  by 

^[E)  dE  =  (J^)  (M c^YlB'T  -^^^-^  AE.  (24) 
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It  is  gratifying  to  find  that  the  theory  leads  naturally  to  the  conclusion  that 
the  spectrum  of  the  cosmic  radiation  obeys  an  inverse  power  law.  By  a  comparison 
of  the  exponent  of  this  law  with  the  one  known  from  cosmic  rays  observations, 
i.e.  about  2.5,  we  find  a  relationship  which  permits  us  to  determine  the  value 
of  T,  the  time  interval  between  collisions.  We  have 

2.9=  1  +  {rlB^T)  (25) 

from  which  it  follows  that  t  =^  1.9  5^  T  =  4  x  10"' sec /^  1.3  years.  Since  the 
particles  travel  with  approximately  the  velocity  of  light,  this  corresponds  to  a 
mean  distance  between  collisions  of  the  order  of  a  light  year  or  about  10^®  cm. 
Such  a  collision  mean  free-path  seems  to  be  quite  reasonable. 

The  theory  explains  quite  naturally  why  no  electrons  are  found  in  the  primary 
cosmic  radiation.  This  is  due  to  the  fact  that  at  all  energies  the  rate  of  loss  of  energy 
by  the  electron  exceeds  the  gain.  At  low  energies  up  to  about  300  MeV  the  loss 
is  mainly  due  to  ionization.  Above  this  energy,  radiative  losses  due  to  acceleration 
of  the  electrons  in  the  interstellar  magnetic  fields  play  a  dominant  role.  This  last 
energj^  loss  is  quite  negligible  for  protons.  Also  the  inverse  Compton  effect  discussed 
by  Feenberg  and  Primakoff  ^  will  contribute  to  eliminate  high  energy  electrons. 

Since  Fermi  ^  first  propounded  this  theory  there  have  been  changes  in  the  views 
about  the  structure  of  the  galactic  field  which  affect  the  validity  of  this  theory. 
They  are:  (1)  the  galactic  field  is  now  considered  much  more  regular  than  he 
assumed,  (2)  the  protonic  and  heavier  components  have  very  much  the  same  energy 
spectrum.  This  is  incompatible  with  the  absorption  mechanism  of  Fermi,  since 
heavy  nuclei  have  larger  cross-sections  for  collision  than  protons.  The  difficulty 
would  be  removed  if  we  were  to  introduce  an  absorption  process  equally  effective 
for  both  types,  i.e.  for  example,  collision  against  stars.  But  simple  estimates  show 
that  probability  of  such  a  collision  is  vanishingly  small.  Another  means  of  re- 
moving cosmic  ray  particles  is  by  diffusion  outside  the  galaxy.  For  such  a  theory 
it  is  necessary  to  assume  that  the  escape  time  for  diffusion  outside  the  galaxy  is 
appreciably  shorter  than  the  nuclear  collision  time.  Since  this  method  of  removal 
is  very  efficient  it  was  found  necessary  by  Fermi  ^  to  improve  the  acceleration 
mechanism  in  a  manner  so  as  to  make  possible  the  attainment  of  high  energies 
before  escape.  This  he  was  able  to  do  by  postulating  a  "jaw  mechanism"  by  which 
a  cosmic  ray  particle  is  "caught"  between  two  magnetic  discontinuities  such  that 
it  is  repeatedly  reflected  and  gains  energy  till  it  escapes  from  the  jaw. 

Let  H  be  the  average  value  of  the  magnetic  field  and  -^max  ^^^  maximum  field 
along  the  line  of  force  that  may  be  likely  to  cause  a  reflection.  If  6  is  the  angle 
of  the  pitch  of  the  spiral  where  the  field  average  is  H,  reflections  will  occur  for 

particles  having  0  >;f  where  sin;^  =  L/-^ .  A  simple  calculation  shows  that  when 

r    -"max 

a  particle  with  0  >  ;f  is  caught  in  a  trap  both  its  energy  and  angle  of  pitch  will 


1  E.  Feenberg  and  H.  Primakoff,  Phys.  Rev.  73,  449  (1948). 
^  E.  Fermi,  loc.  cit. 
3  E.  Ferivh,  loc.  cit. 


ORIGIN    OF    COSMIC    RADIATION 


489 


change  with  time,  but  the  product  E  sin  d  remains  approximately  constant.  The 
particle  can  escape  the  trap  only  when  d  has  decreased  to  such  a  value  that  the 
condition  0  >  ;f  is  no  longer  fulfilled.  In  this  process,  the  energy  must  increase 
by  a  factor  sin0/sin;f.  This  process  leads  to  a  sizeable  energy  gain  in  a  relatively 
short  time.  For  example  if  the  jaws  of  the  trap  are  10  light  years  apart  and  move 
towards  each  other  at  10  km/sec  the  time  required  for  a  10  per  cent  energy 
increase  is  only  a  few  thousand  years. 

2.  SOURCES  OF  COSMIC  RADIATION 


We  have  till  now  discussed  the  accelerating  mechanism  in  quite  a  general 
manner.  The  actual  domain  of  storage  and  acceleration  as  we  have  mentioned 
already  depends  on  the  sources.  We  shall  now  take  up  the  discussion  of  the  sources 


1 
Solar  origin 

II 

III 

Source 

Galactic 

Extragalactic 

origin 

origin 

1.  Total  energy  of  the  radiation  (too  high  if 

+ 

^ . 

extra  galactic  in  its  origin). 

2.  Universal  occurrence  of  cosmic  radiation. 

+ 

-i- 

+ 

Other  celestial  bodies  can  also  produce 

• 

these  radiation. 

3.  Absence  of  Compton-Getting  effect.  (Due 

+ 

- 

— 

to  this  effect  an  isotropic  cosmic  radiation 

in  space  will  cause  higher  intensity  to  be 

observed  in  the  apex  of  the  earth  than  in 

the  anti-apex.) 

4.  Correlation  of  intensity  with  solar  flares 

+ 

— 

— 

argues  for  incident  radiation  from  the  sun. 

5.  No  day  and  night  effect  in  the  intensity 

— 

A- 

+ 

of  these  rays  precludes  the  solar  origin. 

6.  Presence  of  any  11  year  effect. 

+ 

- 

- 

7.  The  theory  of  Fermi  which  requires  ex- 

— 

+ 

+ 

tensive  magnetic  fields  and  dust  clouds 

argues  only  for  a  galactic  origin. 

8.  Absence  of  any  directional  effect  found  in 

— 

-f 

-1- 

the  continuous  radiation  is  unfavourable 

for  the  idea  of  a  solar  origin. 

9.  Directional  effect  of  high  energy  particles 

- 

+ 

+ 

pointing  to  points  in  galaxy  from  which 

radio  frequency  waves  arrive,  argues  for 

a  galactic  origin. 

10.  Presence  of  high  energy  heavy  particles 

— 

+ 

+ 

can  be  produced  only  in  the  favourable 

conditions  obtaining  in  supemovae  and 

so  is  an  argument  against  solar  origin. 

11.  Presence  of  Li,  Be,  B. 
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of  cosmic  radiation.  The  three  possible  origins  of  cosmic  rays  are  (1)  solar  origin, 
(2)  galactic  origin  and  (3)  extragalactic  origin.  The  table  given  above  due  to 
Clay  argues  for  the  greatest  possibility  for  a  continuous  radiation  of  galactic 
origin  with  the  superposed  radiation  from  the  sun. 


Solar  contribution  to  cosmic  radiation 

There  is  evidence  for  magnetic  fields  of  a  few  thousand  gauss  in  the  neighbour- 
hood of  sun  spots  which  presumably  extend  above  the  surface  of  the  sun  to  con- 
siderable heights.  At  these  heights,  the  energy  density  due  to  the  magnetic  field 
is  apt  to  be  considerably  greater  than  the  energy  density  present  as  hydrodynamic 
energy  and  thermal  energy  per  unit  volume  when  carried  by  the  material,  since 
the  density  of  the  material  is  only  10®  atoms  per  cm^.  Under  these  conditions 
the  motion  of  the  ionized  material  is  completely  determined  by  the  lines  of  force. 
The  energy  density,  pressiu*e  and  inertia  of  the  material  react  little  on  the  changes 
of  the  magnetic  fields.  If  there  is  any  motion  or  change  in  the  magnetic  field 
due  to,  say,  sun  spot  activity,  the  tenuous  material  of  the  corona  will  be  swept 
along  by  the  moving  magnetic  field.  In  particular,  rapidly  moving  magnetic  fields 
may  accelerate  matter  in  the  corona  to  a  sufficiently  high  velocity  and  this 
constitutes  the  injection  mechanism  in  the  Richtmyer-Teller^  theory.  We  now 
imagine  that  an  ionized  gas  is  accelerated  to  10®  eV/sec  and  a  fraction  of 
gas  finds  itself  at  the  end  of  the  acceleration  process  in  a  region  of  low  magnetic 
intensity.  If  now  Fermi  collisions  should  occur  with  wandering  magnetic  fields 
of  small  intensity  and  high  velocities,  particles  of  the  observed  energy  can  be 
produced.  For  this  phenomenon  to  occur  it  is  necessary  to  postulate  a  field  of 
10"^  gauss  confined  to  the  solar  system  which  is  necessary  to  trap  the  particles 
in  the  solar  system  while  they  are  being  energized  and  to  render  the  distribution 
isotropic.  Observations  have  shown  that  the  geomagnetic  field  and  other  fields  in 
our  system  deflect  the  cosmic  rays  by  ^/jo^^  of  a  radian  in  about  ten  earth  radii 
(for  a  typical  10  BeV  cosmic  ray).  The  field  turns  out  to  be  one  milHgauss  or 
a  little  less.  There  is  a  paucity  of  cosmic  rays  below  2  BeV  and  this  cut-off" 
varies  with  the  solar  activity  cycle  of  eleven  years  and  is  a  sign  of  the  influence  of 
the  magnetic  field  around  the  sun  on  cosmic  rays.  Alfven^  is  in  favour  of  the 
Richtmyer-Teller^  theory  and  supports  the  postulate  regarding  the  existence 
of  an  extensive  magnetic  field.  The  solar  dipole  is  insufficient  to  cause  such  a  field 
and  Alfven*  suggests  the  following  mechanism.  Suppose  that  in  an  electrically 
conducting  medium  with  the  mass  density  g  there  is  an  external  magnetic  field  H\ 
If  parts  of  the  medium  are  put  into  motion  with  velocity  v,  the  magnetic  field  Hq  is 
superimposed  on  the  initial  field.  In  the  case  of  an  infinite  electrical  conductivity, 


1  R.  D.  RicHTMYER  and  E.  Teller,  Phys.  Rev.  75,  1729  (1949). 

2  H.  Alfven,  loc.  cit.  (1950). 

^  R.  D.  RiCHTMYER  and  E.  Teller,  Ibid. 
*  Alfven,  loc.  cit.  (1950). 
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the  average  value  of  H'  increases  until  the  magnetic  energy  equals  the  kinetic 

energy;  — — =  \qv'^.  When  this  limit  is  reached  an  equipartition  of  energy  is 

accomplished  and  the  magnetic  field  energy  is  converted  back  into  kinetic  energy. 
The  state  characterizing  a  magneto  hydrodynamic  wave  is  reached.  We  shall 
apply  these  results  to  interstellar  spaces  around  the  solar  system.  Setting 
Q  =  10~2*  gm  cm"^  which  is  the  usually  accepted  value  of  interstellar  density, 
we  have  H'  =  v(4i7iQf^^=  3.5  x  lO'^'^v.  A  magnetic  storm  is  usually  supposed 
to  be  produced  by  a  corpuscular  beam  sent  out  radially  from  the  sun  with  a  velo- 
city of  the  order  2  x  10^  cm  per  sec.  When  it  passes  through  the  solar  magnetic 
field,  the  beam  will  distort  the  magnetic  lines  of  force  and  will  cause  an  induced 
field  which  will  grow  up  to  the  order  of  H'  =  3.5  x  IQ-^^  x  2  x  10^  -  7  x  10"* 
gauss.  It  should  also  be  observed  that  the  beam  consists  of  ionized  gas  which  has 
a  high  conductivity.  The  magnetic  field  in  the  region  close  to  the  sun  where  the 
beam  emanates  may  in  part  be  frozen  into  the  matter  of  the  stream.  In  this  way, 
the  magnetic  field  is  transported  outwards.  Both  these  effects  produce  magnetic 
fields  in  the  outskirts  of  the  solar  dipole  field  which  are  much  stronger  than  the 
dipole  field  itself.  Considerations  of  the  decay  of  these  fields  and  the  total  energy 
necessary  to  produce  fields  of  an  extension  and  strength  sufficient  to  trap  cosmic 
radiation  of  varying  energy  however  make  it  rather  dubious  whether  this  effect 
is  sufficient.  The  solar  origin  of  cosmic  rays  might  seem  attractive  for  the  low 
energy  component  because  only  a  very  small  fraction  of  the  sun's  energy  would 
be  required  to  maintain  the  cosmic  radiation.  On  the  other  hand  the  solar  hjrpo- 
thesis  runs  into  difficulties  when  trying  to  account  for  the  cosmic  ray  particles 
of  high  energies.  Particles  with  more  than  10^*  eV  per  nucleon  have  been  observed, 
and  if  such  particles  are  assumed  to  have  been  accelerated  on  the  solar  surface, 
fields  of  1000  gauss  extending  over  at  least  3  x  10®  cm  will  be  needed  to  confine 
them.  In  the  latter  phase  when  these  particles  circulate  in  interplanetary  space 
fields  of  10"^  gauss  extending  over  at  least  3  x  10^^  cm  or  10^  light  seconds  will 
be  needed.  A  further  argument  against  the  solar  origin  for  the  high  energy  com- 
ponent is  the  fact  that  the  additional  effects  which  accompany  solar  outbursts 
are  considerably  less  energetic  than  the  steady  component  of  cosmic  radiation. 
Moreover  the  sun  makes  very  few  cosmic  ray  particles  above  10  BeV  even  in  its 
most  violent  spasms.  It  is  appropriate  to  look  farther  afield  for  the  origin  of 
high  energy  cosmic  rays.  The  solar  disturbances  which  coincide  with  the  intensity 
increases  in  cosmic  rays  are  also  associated  with  the  emission  of  radio  waves  of 
a  few  metres  in  wavelength.  The  sun  however  is  not  the  only  celestial  body 
which  emits  such  radio  waves.  Definite  positions  in  the  sky  called  ''radio  stars" 
emit  radiation  similar  to  that  which  is  emitted  by  active  spots  on  the  sun.  It 
is  reasonable  to  suggest  that  the  radio  stars  in  addition  to  emitting  radio  waves 
should  also  produce  energetic  particles  similar  to  those  which  the  disturbed  sun 
emits.  The  greater  average  radio  intensity  of  the  stars  might  give  reason  to  expect 
a  greater  flux  of  cosmic  ray  particles  arriving  from  them.  Thus  it  seems  likely 
that  the  high  energy  component  of  cosmic  radiation  is  due  to  phenomena  similar 
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to  solar  magnetic  activities,  but  actually  taking  place  in  distant  astronomical 
objects. 

Galactic  sources  ot  cosmic  rays^ 

Conclusions  based  on  the  distribution  of  the  electronic  component  of  cosmic 
radiation  at  the  source  by  a  stud}-  of  cosmic  radio-emission  will  form  a  valuable 
supplement  to  our  knowledge  of  the  composition  of  the  primary  cosmic  radiation 
in  the  vicinity  of  the  earth.  Cosmic  radio-emission  consists  of  a  component  from 
discrete  sources  (radio-nebulae)  and  of  a  general  emission  of  galactic  origin  con- 
taining both  thermal  and  non-thermal  components.  Evidence  for  the  non-thermal 
component  comes  from  the  fact  that  if  we  assume  that  the  effective  temperature 
TfQ  of  thermal  radiation  cannot  exceed  the  kinetic  energy  of  interstellar  gas 
(which  even  in  the  completely  ionized  region  never  exceeds  10^  degrees)  we  should 
have  for  T^q  a  value  very  much  less  than  lO"*  degrees.  For,  in  the  region  of  the 
galactic  centre  or  the  galactic  pole  (where  the  optical  thickness  of  the  gas  is  small) 
actually  T^q  -  2.7  x  10^  degrees  for  A  -  16.3  m  and  lOMegrees  for  A  =  32.8  m  etc. 
in  the  region  of  the  galactic  centre  and  T^q  ^  7.5  x  10*  degrees  for  A  =  16.3  m  in 
the  region  of  the  galactic  pole  (A  being  the  wavelength  of  the  radiation).  Hence 
there  should  be  a  non-thermal  galactic  radio-emission,  the  sources  for  which  (from 
the  analysis  ofradio-isophotes)  should  form  a  quasi- spherical  sub-system  of  radius 
B^3.5x  10-2  cm  which  is  approximately  the  spatial  distribution  of  diffuse 
interstellar  gas  at  a  very  low  concentration  {n  ^  0.1  cm-^). 

The  non-thermal  component  can  be  well  accounted  for  on  the  basis  of  the 
"magnetic  bremsstrahlung  hypothesis"  according  to  which  this  part  of  the  cosmic 
radio-emission  is  due  to  the  radiation  by  relativistic  electrons  accelerated  in  inter- 
stellar magnetic  fields.  A  relativistic  electron  of  energy  E  following  a  helical 
path  in  a  magnetic  field  H  with  a  frequency 

eH    mc^ 
mc      E 

will  emit  radiation  almost  exclusively  into  a  narrow  cone  of  apertures  ^  — ^^—  <  1 . 

Hence,  if  the  electron  moves  in  a  circle,  an  observer  in  the  plane  of  the  orbit 
will  observe  pulses  of  radiation  of  period 

where  r  = is  the  radius  of  the  orbit  and  the  factor  ( — ^7^-  I   is  due  to  the 

oj„  \    E    I 

Doppler  effect.  If  the  electron  moves  helically  and  the  angle  between  its  velocity 

and  the  magnetic  is  not  too  small,  6  ^  oc  ^  mc^jE  and  we  have 

.  mc    I  mc^^ 

At  ^— --(---]  (3) 


mc    /  mc^  Y 


^    V.  I.  GiNZBURG,  loc.  Cit. 
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where  H ^  is  the  component  of  the  magnetic  field  normal  to  the  direction  of  motion. 
The  radiation  thus  consists  of  a  continuous  spectrum  consisting  of  harmonics  of 
frequency  Wji  with  a  maximum  at  the  frequency 


1         e^ 


(AT- 


(S) 


More  detailed  calculations  of  the  energy  radiated  by  an  electron  per  second 

dco 
in  a  frequency  range  dv  =  -^ —  viz.,  P{v,  E)dv  have  been  done.  The  expression 

for  P{v,  E)  is  of  the  form 

Piv,E)=^p{v)  =  m^^J^] 

At  maximum  frequencj^ 

i'(^max)  =    ^'^^^f^   =  2.16  X  10-22f^^  erg  sec- 1  cycle- 1.  (6) 

I'max  =  0.5  wJ2  7i=\AxW  H^  {E^jm  c^f .  (7) 

The  intensity  of  magnetic  bremsstrahlung  emission  observed  at  the  earth  is 

/,  =  -^-  fp{v,  E)  N,{E,  r)  dEdr  (8) 

where  Ng(E,  r)  is  the  differential  electron  spectrum  at  the  point  r.  If  we  assume 
that  all  the  electrons  have  a  common  energy  E^  we  have  for  the  maximum 
intensity  of  the  radiation 


e 


4:71 

-  1.7x  10-23  ^^^^^i?  erg  cm- 2  sec- isterad-1  cycle- 1  (9) 

where  N^  and  Hj^  are  average  values  along  the  line  of  sight  and  R  is  the  extent 
of  the  emitting  region  in  that  direction. 

For  thermal  emission  in  the  direction  of  the  galactic  pole 

iv  = ;^^  ^  eff  = -p- ^  eff  U") 


where  T^g  -  7.5  x  lO'*  degrees  for  A  =-  16.3.  Hence 

2kT^ 
A21.7X  10-23 


Ne  H^R=    .,,„..  :°_,,   ^  4.6  X  W  (11) 


As  discussed  earlier  in  this  chapter,  there  is  reason  to  believe  that  the  intensitj^ 
of  interstellar  magnetic  fields  is  of  the  order  3  x  10-^  —  10"^  gauss.  Using 
H j^  ^  10"^  gauss  and  i?  ^  5  x  10^^  cm  (for  the  galactic  "corona")  we  get 

iV,^  10-12  cm-3  (12) 

and  /  ^  \i/, 
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wliich  is  in  agreement  with  the  data  on  the  electronic  component  of  cosmic 
radiation  near  the  earth.  Thus  the  magnetic  bremsstrahlung  hypothesis  seems 
to  be  in  good  agreement  with  experimental  facts.  Further  if  we  assume  a  power 
law  spectrum  for  Nf(E)  we  infer  that  the  emission  at  A  =  16.3  m  is  determined 
mainly  by  electrons  with  energies  in  the  range  10®  <  ^  <  6  X  10®  eV  if  the  valuo 
for  the  exponent  y  is  taken  as  2.64  which  is  in  agreement  with  the  data  of 
radio-astronomy. 

Thus  the  above  considerations  lead  to  the  conclusion  that  the  galaxy  can  be 
thought  of  as  a  spherical  container  for  cosmic  radiation  of  radius  i2  ==  3.5  x  10^2  cm 
and  with  walls  that  are  practically  perfect  reflectors.  That  the  assumption  about 
perfect  reflection  at  the  boundary  is  reasonable  can  be  seen  if  we  consider  the 

approximate  equality  ^gu^  a^f-z—H^  where  q  is  the  density  of  the  medium  and  u 

its  velocity.  For  the  galactic  "corona"  q  ^  10~26-l0-2«  g  cm^  while  for  inter- 
galactic  space,  Q  ^  10"2»-l0-30g  qjj^2  which  would  give  H  ^  1-3  x  10"'' gauss. 
Thus  the  properties  of  the  galactic  "corona"  and  the  intergalactic  medium  are 
very  different  and  the  difference  in  their  magnetic  field  strengths  will  alone  be 
enough  to  ensure  that  the  galaxy  reflects  all  particles  coming  from  outside  it. 
The  total  energy  of  cosmic  radiation  in  the  galaxy   Wc.r  will  be  equal  to 

^'cr  X  "^^-R*  where  w^.r^  leVcm~^  for  protons  (it  will  be  an  order  10 "^  less 

for  electrons).  Thus 

W,.r  ^  10«'  —  10«8  eV  ^  10^5  —  105«  ergs.  (14) 

The  radio-emissions  from  nebulae  such  as  Cassiopeia-A  and  the  Crab  nebula  can 
be  similarly  considered  to  be  of  magnetic  bremsstrahlung  origin. 

A  further  confirmation  for  the  hypothesis  comes  from  the  data  on  the  polari- 
zation of  the  optical  spectrum  of  both  galactic  (e.g.  the  Crab)  or  extragalactic 
(e.g.  Virgo.  A)  nebulae.  Only  the  magnetic  bremsstrahlung  mechanism  can  explain 
a  strong  polarization  since  the  emission  from  relativistic  electrons  will  be  strongly 
polarized. 

Thus  the  radio-astronomical  evidence  points  to  a  galactic  origin  of  cosmic  rays 
and  the  presence  of  cosmic  electrons  throughout  the  galaxy  is  in  complete 
contradiction  with  theories  of  an  exclusively  solar  origin  of  cosmic  radiation. 
We  shall  now  consider  some  particular  examples  of  possible  sources. 

Cassiopeia-A  and  Taurus-A  (the  Crab  nebula)  which  are,  as  already  mentioned, 
sources  of  powerful  non-thermal  radio -emission  are  the  expanding  envelopes 
formed  during  the  explosions  of  supernovae  of  the  years  369  and  1054.  The  en- 
velopes of  other  supernovae  also  emit  such  radiation  so  that  radio-astronomy 
gives  us  the  important  result  that  the  explosions  of  supernovae  lead  to  an  abundant 
production  of  relativistic  electrons.  Calculations  (e.g.  Oort  and  Walraven^)  show 
that  the  energy  spectrum  of  these  electrons  ranges  from  y  ^  3.5  to  y  '=^  1.5. 
The  total  energy  of  the  electrons  varies  between  10*^-10*®  ergs. 


^  J.  H.  Oort  and  I.  Walraven,  Bull.  Nether.  Astron.  Inst.  12,  285  (1956). 
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It  is  natural  to  postulate  that  relativistic  electrons  are  also  produced  from  ex- 
plosions of  novae  although  these  electrons  will  have  energies  of  a  smaller  order 
(10*2-10**  ergs)  since  the  novae  have  an  energy  output  10^-10*  times  smaller  than 
that  of  supernovae.  As  supernovae  do  not  erupt  in  our  galaxy  oftener  than  once 
in  30  years  (10^  sec),  the  rate  of  generation  of  relativistic  electrons  from  them 
will  be  _ 

Ue.sn  =  —-  ^ ^^ ^  10^6  -  10^9  erg  sec-i.  (15) 

For  novae,  for  which  the  period  t„  is  0.01  year, 

C/e,«  =  3  X  lO^^-lO^s  erg  sec-1.  (16) 

Now,  on  the  sun,  particles  of  energy  of  the  order  of  several  BeV  are  formed  on 
the  average  at  a  rate  10^^-10^*  sec~^  and  in  spite  of  the  10^^  stars  of  our  galactic 
system  emitting  radiation  of  the  same  power,  the  rate  would  be  only 
10^2-10^^  erg  sec"i.  The  mechanism  of  loss  of  energy  by  protons  and  nuclei  in  the 
interstellar  medium  is  mainly  by  nuclear  collision.  (For  protons  the  lifetime  is 
Tp^Qx  10®  years  =  1.9  x  10^^  sec.)  Hence  assuming  that  the  protons  carry 
a  good  percentage  of  the  total  cosmic  ray  energy  W^.r  in  the  galaxy  we  get  for 
the  rate  of  loss  energy  by  cosmic  rays  in  the  galaxy 

W 
U^,,  =  — f^  ^  W^  -  10*0  erg  sec- 1.  (17) 

^  p 

As  we  saw,  the  loss  of  energy  by  the  electrons  in  interstellar  space  is  mainly  by 
magnetic  bremsstrahlung  and  a  calculation  on  the  basis  that  electrons  carry  the 
bulk  of  the  energy  of  cosmic  rays  gives 

C7e  =  103«-103»  erg  sec-i.  (18) 

The  above  facts  conclusively  show  that  only  the  supernovae  origin  of  cosmic 
rays  is  in  consonance  with  such  high  rates  of  energy  loss  from  the  galaxy  since 
they  are  the  only  possible  sources  of  injection  into  the  galaxy  of  cosmic  electrons 
at  a  power  of  10^^-10^^  erg  sec"^.  There  is  no  direct  evidence  about  the  transfer 
of  energy  to  protons  and  other  nuclei  during  these  explosions,  but  it  is  extremely 
probable  that  energy  is  transferred  to  them  in  the  same  manner  and  quantity 
as  to  the  electrons. 

The  above  conclusions  are  not  dependent  on  or  influenced  by  the  mechanism 
of  particle  acceleration  which  by  itself  forms  an  independent  and  important 
problem.  There  are  reasons  to  believe  that  the  Fermi  mode  of  acceleration  may 
not  be  very  effective  in  interstellar  space.  In  the  first  place,  the  movement  of  the 
interstellar  medium  is  relatively  slow  {u  ^  10'  cm  sec"^)  and  secondly  the  free 
path  A  is  very  great  (^  3  x  10^?  cm).  Now  the  coefficient  oc  which  determines  the 
rate  of  mean  particle  energy  increase  in  the  equation 

~=-ocE  (19) 

at 
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is  equal  to     g.    where  v  is  the  velocity  of  the  particle  and  hence  is  small  in 

C  A 

interstellar  space.  In  the  envelopes  of  supernovae,  velocities  of  the  order  of 
1-3  X  10®  cm  sec"^  are  to  be  found  and  in  the  early  stages  of  turbulence,  A  ^  10^^  cm. 
ThuS(X  will  have  a  value  very  much  higher.  Assimiption  of  isotropic  turbulence  in 
the  envelope  leads  to  values  of  a  ^  10®.  Statistical  acceleration  will  be  extremely 
effective  in  the  explosion  process  at  the  centres  of  the  supernovae  themselves 
before  the  envelope  has  separated  from  it.  Further  evidence  for  this  idea  comes 
from  the  fact  that  under  statistical  acceleration  the  energy  reached  by  a  particle  is 
proportional  to  its  mass  (1-22 a)  and  thus  protons,  e.g.  will  be  accelerated  to  an 
energy  about  1840  times  that  of  the  electrons.  (This  may  be  even  higher  due  to 
magnetic  bremsstrahlung  losses  with  the  increase  of  energy  of  electrons.)  Calcu- 
lations derived  on  this  basis  give  an  energy  spectrum  which  to  the  accuracy  at 
present  available,  corresponds  to  the  actual  spectrum  of  cosmic  radiation.  The 
value  for  the  total  energy  of  cosmic  radiation  in  the  galaxy 

W,.r--  10«M06«ergs  (20) 

can  be  accounted  for  by  assuming  that  the  resultant  average  spectrum  of  ail 
cosmic  radiation  is  the  sum  of  the  spectra  of  cosmic  particles  accelerated  in  the 
eruptions  of  about  10^  "10^  supernovae  (the  total  energy  of  relativistic  electrons 
in  supernovae  is  10*^-10^®  ergs). 

The  supernovae  origin  of  cosmic  radiation  is  also  confirmed  by  the  composition 
of  cosmic  radiation  near  the  earth.  The  estimation  of  the  latter  requires  the  use 
of  the  system  of  equations 

^^F{D,FN,)-^  +  2:Pii^  +  ^i,  .(21) 

where  Ni(r,  t)  is  the  concentration  of  nuclei  of  species  i,  qi(r,  t)  is  the  number  of 
nuclei  of  the  same  kind  formed  at  the  source  per  imit  volume  per  unit  time, 
Di{r)  is  the  coefficient  of  diffusion  and  Ti(r)  is  the  lifetime  of  nuclei  of  species  i 
until  they  are  broken  up  by  nuclear  collision  in  interstellar  space ;  Pi^  is  the  pro- 
bability of  formation  of  a  nucleus  of  species  *  as  a  result  of  the  disintegration  of  a 
nucleus  of  type  ;. 

The  above  system  of  equations  have  been  solved  subject  to  various  assumptions 
such  as 

(a)  the  structure  of  the  galaxy  does  not  change  during  the  lifetime  of  cosmic 

/  dNi         \ 
particles  ( —^rf~  =  0 1 

(b)  the  sources  are  near  the  galactic  plane  (as  is  actually  the  case  for  novae 
and  probably  so  for  supernovae). 

We  present  here  only  the  important  results  which  are : 

(i)  The  distance  of  the  source  from  the  earth  r  =  1.8  x  lO^^  cm  which  is  only 
1.4  times  smaller  than  the  actual  distance  of  the  earth  from  the  centre  of  the 
galaxy. 
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(ii)  Assuming  a  proper  distribution  of  sources  together  with  the  diffusion  of 
cosmic  particles  the  relatively  small  amounts  of  the  elements  Li,  Be  and  B  near 
the  earth  can  be  explained  and  further  in  order  to  establish  a  connection  with 
the  data  on  the  composition  of  the  radiation  in  the  vicinity  of  the  earth  we  have 
only  to  assume  that  the  relative  number  of  medium  and  heavy  nuclei  in  the  sources 
are  from  five  to  ten  times  higher  than  the  universal  average.  Such  an  assumption  is 
in  agreement  with  recent  pictures  of  the  constitution  of  supernovae.  The  solu- 
tions of  the  diffusion  equations  mentioned  above  determine  also  the  distribution 
of  cosmic  radiation  in  the  galaxy  and  hence  can  be  used  to  evaluate  the  degree 
of  anisotropy  6.  The  result  is  that  one  can  achieve  d  <  10~^  in  agreement  with 
experiment  for  particles  with  E  <  10^  eV  and  a  value  6  ^  10"^  for  nuclei  with 
E  ^  10^8  eV. 

Thus  we  may  conclude  that  the  picture  of  the  origin  of  cosmic  radiation 
developed  here  with  the  envelopes  of  novae  and  supernovae  as  the  sources  of 
cosmic  radiation  seems  to  be  in  consonance  with  the  experimental  results  on 
the  energy  spectrum,  the  composition  and  the  isotropy  of  cosmic  radiation. 

There  is  always  the  possibility  of  considering  even  extragalactic  sources  of 
cosmic  radiation  but  in  such  a  case  the  storage  space  must  be  the  intergalactic 
space  and  the  acceleration  must  also  take  place  there.  Since  the  storage  space 
is  large,  very  weak  magnetic  fields  are  sufficient  to  confine  the  particles  in  it. 
The  extremely  high  energy  spectrum  may  perhaps  be  traced  to  such  sources 
(e.g.  M  87). 
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APPENDICES 


1.  DIMENSIONS  AND  UNITS 


It  is  well-known  that  the  dimensions  of  various  physical  quantities  could  be 
expressed  in  terms  of  a  chosen  few  (L,  M,  T-length,  mass  and  time  in  mechanics) 
which  are  treated  as  fundamental  dimensions.  This  choice  of  fundamental  dimen- 
sions is  quite  arbitrary. 

In  the  study  of  elementary  particles  it  is  convenient  to  choose  velocity  V  and 
action  A  as  fundamental  dimensions,  the  unit  being  the  velocity  of  light  c  and  the 

h 
Planck's  constant  divided  by  27c,  i.e.- —  =  h  respectively i.  This  choice  is  called  the 

system  of  natural  units  and  is  usefully  employed  in  the  algebraic  calculations  of 
quantum  theory  so  that  one  need  not  write  the  often  occurring  c  and  h  throughout 
a  calculation. 

The  definition  of  this  system  of  units  will  be  complete  if  another  dimension 
and  its  unit  are  specified.  This  choice  is  made  to  suit  the  problem  under  investi- 


Dimensions 

Conversion  factor 
(C.  G.  S./natural) 

Physical 

h  =  c=l 

h  =  c--- 

1  ^^1 

^  =  1 

Quantity 

in  terms 

in  terms 

in  terms 

in  terms 

c=l 

c=l 

oi{L,M,T) 

if  {A,  V,M) 

of{A,V,L) 

o{{A,V,T) 

m^l 
system  i 

Igm 
system 

1  cm    1  sec 
system  system 

Mass 

M 

M 

AL-^V-^ 

AV'T-^ 

m 

1 

h/c 

h/c^ 

Length 

L 

AV-^M-^ 

L 

VT 

h/mc 

h 

c. 

1 

c 

(Compton 
wave- 
length) 

1 

Time 

T 

AV-'M-' 

L  F-i 

T 

h/m  cr- 

1 

Velocity 

LT-^ 

V 

V 

V 

c 

c 

c 

0 

h 

c 

Momentum 

MLT-^ 

M  V 

AL-^ 

AV-^T-^ 

m  c- 

c 

h 

Energy 

ML-^T-^ 

M  V 

AL-^  V 

A  y-i 

m2  r;2 

he 

h 

Cross-section 

U- 

A'V-'M- 

'■         L' 

yz  ji-i 

1 

c' 

Charge 

K-'I^M'I.L'l-z 

K-^ 

A^ 

vi 

^c 

he 

^c 

he 

1  In  C.G.S.  units, 
C  =  2.99793  X  lO^o    cm. sec. 


1 .05443  X  10-  27    erg.  —  sec. 
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e  =  4.80286  x  10"  i»    e.s.u. 
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gation.  This  dimension  may  be  taken  as  mass.  Usually  in  meson  theoretical 
calculations  the  jr-meson  mass  is  taken  as  unity,  while  in  a  purely  quantum 
electrodynamical  calculation,  the  electron  mass  could  with  advantage  be  taken 
as  unity  or  as  in  mechanics  one  gram  could  taken  as  the  unit.  Some  authors  take 
the  length  or  time  as  the  third  dimension  with  one  centimetre  or  one  second 
as  the  unit  respectively. 

While  comparing  with  experiments  however  one  has  to  come  back  to  the  C.G.S. 
system  with  units  of  LMT  being  1  cm,  1  g  and  1  sec  respectively  and  this  means  a 
proper  replacement  of  c's'and  ^'sin  the  final  expressions  of  a  theoretical  calculation . 
This  can  be  achieved  by  elementary  dimensional  analysis  and  the  table  given  above 
furnishes  the  various  conversion  factors. 

LMT  stand  as  usual,  for  Length,  Mass  and  Time  while  A  and  V  denote 
Action  and  Velocity  respectively.  A  =  ML^T-^,   V  =  LT-^. 

In  appljring  the  conversion  factors  to  the  final  expressions  (to  convert  into 
C.G.S.  units)  one  has  to  make  sure  that  all  the  quantities  occurring  in  the  expression 
are  already  expressed  in  natural  units.  For  example,  in  an  electro-dj^amical 
calculation  the  charge  e  occurring  in  a  final  expression  for  cross-section  has  to  be 
expressed  in  natural  units  before  applying  the  conversion  factor  for  the  cross- 
section. 

It  is  sometimes  said  that  in  the  natural  system  of  units,  all  physical  quantities 
have  dimensions  of  powers  oiL  only.  This  statement  refers  to  the  ^  =  1,  c  =  1  and 
L  system  and  is  understood  if  we  allow  the  ^'s  and  c's  in  the  conversion  factors 
to  carry  with  them  their  dimensions.  In  this  system  we  have: 


Physical  quantity 

Dimension 

Unit 

Mass 

Time 

Velocity 

Energy 

Action 

L 
W  (Dimensionlcss) 

L^  (Dimensionless) 

h  c- 1  cm-  1 
c~^  cm 

c 

h  ccm-1 

h 

The  ^'s  and  c's  in  the  last  column  are  treated  as  dimensionless.  From  this  point  of 
view,  all  quantities  become  dimensionless  in  the  h=l,  c=l,  m=l  system^ 
excepting  for  electromagnetic  quantities  like  charge  which  carry  a  dimension  in  K, 

2.  COMPOSITION  OF  ANGULAR  MOMENTA 

Addition  of  two  angular  momenta — Clebsch— Gordan  coefficients  ^ 

In  many  physical  problems  there  arise  situations  where  two  (or  more)  angular 
momenta  have  to  be  coupled.  Examples  are  the  orbital  and  intrinsic  spin  angular 
momenta  of  a  single  particle,  the  angular  momenta  of  the  particles  of  a  system 
or  the  angular  momenta  of  a  particle  and  the  radiation  it  emits  or  absorbs. 


^  A.  R.  Edmonds,    Angular  Momentum   in  Quantum   Mechanics,   Princeton    University 
Press  (1957).  See  also  Elementary  Theory  of  Angular  Momentum  hyM.  E.  Rose,  John  WUey 

(1957). 
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Consider  a  system  in  which  J^  and  J^  are  two  commuting  angular  momentum 
operators.  The  resultant  angular  momentum  is  given  by 

J=J,+J2.  (1) 

The  state  of  the  system  can  be  represented  in  two  distinct  schemes  for  there 
are  two  sets  of  four  mutually  commuting  observables 

(A)  J?,Ji,/,„J2z 

(B)  JIJIJ^J,.  ^  ^ 

The  simultaneous  eigenfimction  in  the  representation  (A)  in  which  J^,  J\,  J^^ 
and  t/22  ^re  diagonal  can  be  written  as  the  direct  product  \pf^  y)j^'  where  y)f^  is 
the  simultaneous  eigenfunction  of  J^  and  J^^  and  ^J^«  that  of  J|  and  J^^.  If  the 
eigenfunctions  in  the  representation  (B)  in  which  j^ ,  J| ,  J^  and  J^  are  diagonal 
are  wTitten  as  y)f  (/^  j^)  they  can  be  expanded  in  terms  of  the  eigenfimctions  of 
the  other  representation  as 

WT ih h)  =  ZC ih h 7 ;  ^1  ^2 m)  xpl^ xpf^^  (3) 

where  C(j^  j^  j ;  m^  m^  m)  is  called  the  vector  addition  or  Clebsch-Gordan  or  Wigner 
coefficient :  Since  J  ^  J^  +  J^  the  Clebsch-Gordan  coefficient  vanishes  unless 
7n  =  Trii  +  m^.  The  maximum  value  which  /  can  assume  is  obviously  the  maximum 
value  of  m  which  is  j^  +  j^,  i.e.  the  sum  of  the  maximum  Values  of  m^  and  m^ 
respectively.  Thus  ipj^  is  an  element  of  a  column  vector  consisting  of  (2^^  +1) 
(2/2  +  1)  elements.  Thus  the  above  equation  (3)  can  be  written  as 

W(B)=^C^{A),  (4) 

where  ^(A)  and  ^(B)  are  column  vectors  with  typical  elements  as  eigenfimctions 
in  the  schemes  A  and  B  respectively.  We  must  prescribe  a  method  of  writing 
!^(A)  and  ^(B)  explicitly,  and  this  determines  the  way  in  which  the  matrix  C 
has  to  be  written.  The  column  vector  ^'(B)  is  written  by  varying  j  from  |  /^  —  j^  | 
to  ji  +  ^2  and  exhausting  all  the  values  of  m,  i.e.  — /  to  +/  for  each  value  of  ; 
before  going  from  one  value  of  j  to  another.  !f^(A)  is  obtained  by  running  over 
all  the  values  of  m^  for  a  given  mg  as  mg  varies  from  — /g  to  -i-j^.  This  scheme 
fixes  the  method  of  labelling  the  elements  of  C,  i.e.  jm  represent  the  indices  for 
the  column  and  mj^m^  for  the  row.  Since  C  is  unitary  and  if  we  choose  its  elements 
to  be  real  C-^  =  C^  and  the  inverse  transformation  is  given  by 

y,j:-  rT.'  =  ^C(j,j^r,  m.m,m)vf .  (5) 

jm 

It  is  to  be  noted  that  in  this  case  jm  fixes  the  row.  The  unitarity  condition  implies 
U  G 0*1  ?2  j \m^m^m)'C (;,  j^  j' ;  m^ m^ m')  =  d^y  dmm'         .  (6) 

Z  C  iji  J2  j ;  ^1  rn^  w )  C iji  J2  j ;  m[  m'^  m)  =  dm, m[  K, m[  (7)   : 

jm 

with  m  =  m^-\-  m2^  and  m'  =  Wj  -f  77?2  • 
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Computation  of  Clebsch— Goidan  coefficients^ 

We  can  obtain  a  set  of  recursion  relations  which  completely  determine  the 
Clebsch-Gordan  coefficients  with  the  proper  phases.  Knowing  the  state  ipj^,  i.e. 
all  the  coefficients  C{j^j^j;  m^m^m)  for  the  given  j  and  m  we  can  obtain  the 
state  xpf'^  by  using  the  well-known  relation 

J-  y>T  =  MW  +  'm){j-m  +  1)] .  v)f -1,  (8) 

where 

J-  =  Jx—  ^Jy  =  ^^\x  —  ^J\y)  +  {J%x  —  ^J^y)  =  ^i-  +  ^2-  •  (9) 

Expanding  (8)  in  the  scheme  A  and  using  the  relations,  /_  =  J^_  +  Jg-  we 
obtain  {Ji-  +  J2-)S  WT:  Wl'  C ih  k  J ;  ^1  ^2 ^) 

nil  m2 
nil  i^i 
mim., 

=  2  ]/M  +  m)  (/  -  m  +  1)]  ^5^'^  y^t^-^x-i  C  ij^  J2  j:  m^m-m^-l,  m-l).       (10) 

Wil  Wis 

Equating  the  coefficients  of  ^^^^  ^J"*  on  both  sides 

[{j  +  m){j  -m-^  1)]V2 C  (/i ^  j\  m^m^m—\) 
=  [O'l  +  Wi  +  1)  (;i  -  m^)]V.  C (/i  ^2  j;  rrii+lm^ m) 

+  [0*2  +  m  -  mj  (;2  -  m  +  Wj  +  1)]'^^  C{k  h  j\  m^m^^  \m) .  (11) 

This  is  the  recursion  relation  which  steps  down  the  value  of  m.  Similarly  by 
applying  J+  to  yf-^  we  can  obtain  the  following  recursion  relation  which  steps  up 
the  value  of  m  by  unity 

[{j  -m){j  +  m-\-  l)p C{j^ 72 /;  m^ m^m+l) 

=  [ih  -  ^1  +  1 )  iji  +  mi)JI-  C {ji k  j;  rn^-l,  rn^,  rn) 

+  [ih  —  m  +  m^)  {J2  +  m  -  mi  +  1)]'/^  C  {j^  j.^  j ;  m^  m.^  m) .       (12) 

We  can  also  obtain  the  recursion  relation  which  steps  down  the  value  of  j.  As  the 
derivation  is  elaborate  we  give  here  only  the  result: 


m.  —  m- 


Gijihj'y  rriim^m) 


if  -  m2)  {j  -  j^  +  j^)  {j  +  /i  -  j^)  (j^  +  h  +  j+l)  ih  +  ?2  -  ?  +  1) 


4f{2j-l){2j+l) 

C{jihj-  1;  m^m^m) 
{{j  +  1)^  -  m^](;  +  1  -  ?!  +  H)  (;  +  1  +  h  -  h)  ih  +  J2  4-  /  +  2)  (y\  -f-  ;2  -  /)  l^^- 
Mj+l)H2j+l){2j^3) 

Ciji  J2  ?  +  1 ;  mj  mg  m) .         (13) 


1  E.  U.  Condon  and  G.  H.  Shortley,  ''The  Theory  of  Atomic  Spectra'',  Cambridge  Uni- 
versity Press  (1951). 
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The  recursion  relation  (13)  gives  us  the  state  yj"_j  when  we  know  the  state  ^J" 

Starting  with  the  state  of  highest  /  and  m,  i.e.  V'r-jJV ji*  ^^^  which  there  is  only 
one  non-vanishing  coefficient  namely 

<^Uih7  =  Ji  +  h'^  Wi  =  h .  ^2  =  h'y  ^  -=  h  +  ;2)  =  1  (14) 

we  can  obtain  all  the  elements  of  the  matrix  C  using  the  recursion  formulae  (11) 
and  (13).  Since  the  last  element,  i.e.  the  element  corresponding  to  the  last  row 
and  last  column  is  unity  we  can  obtain  every  element  of  the  last  row  by  usmg 
the  recursion  relation  (11).  Once  the  last  row  is  obtained  the  next  row  can  be 
determined  by  the  use  of  the  recursion  relation  (13)  and  by  repeated  applications 
of  these  relations  all  the  elements  of  the  matrix  can  be  obtained. 

The  general  formula  for  the  C- coefficients  was  first  given  by  Wigner  by  the  use 
of  group  theoretical  methods.  His  result  is 

a  +  ii  -  h)Hj  -  h  +  ?2)  U?i  +  J2  -  ?)Hj  +  m)\{j  -m)\{2j+  1) 


y 


(?  +  /,  +  ;2  +  1) !  iji  -  nil) !  (;i  +  Wj) !  {j^  -  m^) !  {j^  +  m^) 


^2 


ii  -  h  +  J2  —  v)\{j  ■\-  m  —  v)\  v\{v  -\-  ji  -  J2  -  m) 


(15) 


In  this  summation  v  takes  all  integral  values  such  that  none  of  the  arguments 
for  the  factorials  becomes  negative. 

A  number  of  derivations  of  the  general  formula  are  now  available  notable  of 
which  are  that  of  Racah  and  of  Schwinger.  The  general  formula  is  so  complex 
that  the  calculation  of  coefficients  by  its  use  is  as  tedious  as  the  direct  use  of 
the  recursion  relations.  However  it  is  only  rarely  necessary  to  have  recourse  to 
these  formulae  since  a  knowledge  of  the  general  properties  of  the  coefficients  will 
often  suffice  and  convenient  algebraic  tables  are  available  for  small  values  of  J. 


Symmetry  properties  of  the  Clebsch— Gordan  coefficients  and  the  Wigner  3-j  symbol 

j^,  7*2  and  j  form  a  triangle  Aij-^^j^j)-  The  Clebsch-Gordan  coefficients  have  a 
geometrical  significance  and  possess  symmetries  similar  to  the  spatial  rotational 
symmetries  of  the  system  concerned.  The  sjrmmetry  relations  are 


-=(-1) 
=  (-1) 


ii  +  r,-i 


C{J2hJ',  m^m^m) 


n-rrii 


2j+l 


2?2+l 


Cihjjz',  ^1'  -^.  -Wa)-         (16) 


The  maximum  symmetry  is  obtained  for  the  Wigner's  3-j  symbol  which  is 
defined  as 


h      ?2 


iyi-r.^m(^2j  ^r  ^)-'^-'C{jiJ2J;  mim^m). 


(17) 
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Even  permutation  of  the  columns  leaves  the  numerical  value  unchanged : 


'    h       ?2 

J 

"    ?2 

j  h 

?  h    h 

.  Wi  mg  - 

-m\ 

mg 

— m  m-^ 

—  mm^m^ 

(18) 


Odd  permutation  is  equivalent  to  multiplication  by  ( — 1)^1+^2^/; 


(-1) 


H'^')-i+) 


h      h  1 


m.  TYio  —m 


h    h       1 
nio  m-,  — m 


h        1    h 
m-,  —mmo 


1  h    h 


(19) 


Rotation  matrices 


To  rotate  the  axis  of  quantization,  we  apply  the  rotation  operator  9^,  to  the 
wave  function  ipf  and  obtain  a  new  wave  function  which  is  a  superposition  of 
different  eigenfunctions  \pf' 

wTi^i  ^')  =  2  ^H^,  P,  y)  wTi^i  ^)  (20) 

m'  mm' 

where  oc,  ^,  y  are  the  Eulerian  angles  of  the  rotation  i?.  The  angles  0,  99  go  over 
to  0',  9?'  in  the  new  system.  The  transformation  matrix  D  has  the  following 
properties  ^  ^,^^_, ^-^^^  ^^,^_ ,_g^)  ^  _D,^^_ ,_K^ ^^^ 


W/ 


BUCCOl-lZ-jj^lf-,-;-, 


rf*(;5a) 


Pf(cos^) 


Another  important  result  is  the  possibility  of  their  expansion  in  what  is  known 
as  the  Clebsch-Gordan  series 


or 


^1    nil 


j,/jii  +  fi2,mi  +  m-i 


(23) 


Note  that  in  the  above  we  have  dropped  the  third  magnetic  quantum  number  in 
writing  the  C  coefficient  since  the  relation  jjl  =^  fx^  +  [x^  automatically  fixes  its 
value. 


The  coupling  of  three  angular  momenta^  Bacah  coefficients  and  the  Wigner  6-j  symbol 

When  we  consider  the  coupling  of  three  or  more  angular  momenta,  more  than 
one  scheme  is  possible.  The  relations  between  different  schemes  lead  to  the  "re- 
coupling  coefficients"  otherwise  known  as  the  "Racah  coefficients". 
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The  addition  of  three  angular  momenta  J^,  Jj,  J3  can  be  effected  by  first 
forming  an  int-ermediate  state  corresponding  to  J^  =  Ji  +  J 2  and  then  combining 
J'  with  J3 .  Alternatively  the  intermediate  state  corresponding  to  J"  =  Ja  +  «^3 
can  be  formed  which  when  combined  with  that  corresponding  to  Jj  yields  the 
final  state.  The  two  final  states  so  obtained  \pjm(J*)  and  xpjm{J")  are  related  by 
a  unitary  transformation 

Wim{J')-ZIi^ri'V>im{J")-  (24) 

J 

The  Racah  coefficients  W  are  defined  by 

4.,..  =  [(2/"  +  1)(2,-'  +  I)]'/.  W(hMh\  m-  (25) 

Therefore  the  Racah  coefficients  relate  the  two  coupling  schemes.  It  should  be  noted 
that  the  phases  can  be  so  chosen  that  they  are  real.  Using  (abed;  ef)  for  [j^  J2JJ3 1  ?'  ?") 
and  ocPyd  for  the  four  projections  of  the  angular  momenta  a,b,c  and  d  on  the 
2  axis  and  noting  that  y  =  a  +  P  -\-  d  the  following  relations  can  be  derived : 

2;[{2e  -f  1)(2/  +  !)]'/«  W{abcd\ef)C{hdf;  ^d)C(afc;  oc,  ^  +  6) 
f 

=  C(abe;(xP)C{edc;oc  +  ^,d)  (26) 

[(26+  1)(2/+  l)]'l^  W(abcd;  f)C(afc;oc,p  +  6) 

=  Z  C{abd\(x^)  C{edc,oc  +  p,  d)  C(bdf ;  ^6)  (27) 

[(2e+  1)(2/+  l)]'l*W{abcd;ef) 

=  SS  C{abe;  ocp)C{edc;oc  +  p,  d)C{bdf;  ^d)  C{afc\  oc^  +  S).        (28) 

W{abcd\ef)  =  0  miless  the  triangular  conditions  A{abe),  A(edc),  A(bdf)  and 
A{afc)  are  satisfied.  The  Wigner  Q-j  symbol  can  be  defined  by 


L  h  i  f 


=  {-iy^''^'^'^'W{j,J2Jh;  j'j").  (29) 


A  cyclic  permutation  leaves  the  symbol  invariant;  so  does  the  interchange  of 
lower  and  upper  arguments  of  any  two  columns.  There  are  twenty-four  such 
operations  which  leave  the  sjmabol  invariant,  which  are  isomorphic  with  the 
symmetry  group  of  a  regular  tetrahedron  with  sides  jij^jzj' j"  and  j.  We 
also  have  the  following  relations  for  the  permutation  of  the  arguments  of  the 
Racah  coefficient. 

W{abcd',  ef)=  W(badc;  ef)=  W{cdab;  ef)=  W{acbd;  fe)  (30) 

Wiabcd;  ef)  =  {-If^^-^-^  W{ebct',  ad)  =  {-If^^-^-' W{aef  d\  be)        (31) 

2]{2e+l){2f+l)  W{abcd;  ef)  W{hbcd;  eg)  =  df,  (32) 

e 

^r(a.c<^;e/)^^;-y;;-;;;>;;;,,.  (33) 


APPENDICES 


505 


The  9-j  symbol 

Wigner  has  considered  the  two  modes  of  composing  four  angular  momenta 
and  established  the  connection  between  these  two  coupling  schemes  as  the 
relation  between  the  L-S  and  J- J  couplings. 


<«  b  (c)  a'  h'  {c')  d  \a  a'  (e)  b  b'  (/)  dy 

=  [{2e  +  1)  (2/  +  1)  (2c  +  1)  (2c'  +  1)]V^ 


aa  e 
bb'  f 
c  c'  d 


(34) 


This  is  invariant  under  even  permutation  of  columns  and  rows  and  under  trans- 
position while  an  odd  permutation  produces  a  change  of  sign 

/ j\«  +  a'  +  e  +  6  +  6'  +  /  +  c  +  c'  +  (Z 

Interchange  of  the  rows  with  the  columns  of  the  3x3  matrix  leaves  the  9-/ 
symbol  unchanged. 

Numerical  values  for  some  of  the  9-;  symbols  have  been  tabulated  by  Sharp 
et  al. 

'  aa'  e 

bb'  f 


[(2c+l)(2c'  +  l)(2c+l)(2/+l)J 


c  c'  d 


=  Z  C{abc\  ocl^y)C{a'b'c';  oc' ^' y'){-lf-''' 

yy'eq)C{aa'e;  oc  -  oi' s)  (-  1)*'-^'  C{b  b' f\  ^  -  ^'  cp){-  lf->" 
C{cc'd',  y-y'd)C{efd;  scpd) 
=  [(2e+  l)(2c'+  1)(2/+  l)(2c+  l)pi;(2A:+  1)  W{aa'df',  ck) 
W{bfc'  a';  b' k)  W{abdc';  ck).  (35) 


3.  ALGEBRA  OF  THE  y-MATRICES^ 

The  y-matrices  occuring  in  the  Dirac   equation   are   defined   through   their 
irreducibility  and  by  the  anticommutation  rules 

{Y„y.}  =  2d^,  (1) 

where 

d^^  =  0       for  jn  ^v 

=  +lfor/^  =  r  =  4 
=  — 1  for  jLi^v=  1,2,3. 
Following  are  the  important  properties  of  the  y-matrices : 


^  R.  P.  Feynman,  Lectures  delivered  at  the  California  Institute  of  Technology  (1953). 
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(1)  If  y^  and  y,'  are  two  irreducible  sets  of  matrices  satisfying  the  above  rule 
(1)  then  they  are  related  to  each  other  by 

y',-=Sy^S-^  (2) 

where  the  matrix  S  is  unique  except  for  an  arbitrary  multiplicative  factor. 

(2)  By  taking  linear  combinations  of  these  y-matrices  we  can  form  sixteen 
linearly  independent  matrices  which  are 

A  --=  1 ;  A  =  y,{fJ^  =  1,  2,  3,  4);  Ts  =  y„yAf^  +  v) 

i\  =  y^y/^  A  =  n  =  71727374-  (3) 

(3)  The  maximum  number  of  linearly  independent  matrices  of  order  n  is  n^. 
Hence  from  (2)  we  find  that  in  this  case  w  ^  4.  We  can  now  give  an  explicit 
represent-ation  of  the  y -matrices  of  order  4 

i=l,  2,  3;     yt=^^      ^ 


0-1 


(4) 


where  tr,-  are  the  2x2  Pauli  matrices.  It  follows  that 

yf  =-l;y?=  1  and  yf  =  -1 
also 

{75,7.} --0.  (5) 

(4)  K  a  is  a  four  vector  we  can  define  the  operator  a  by 

a  =  a^yt  —  a^y^  —  a^y^  —  a,y, .  (6) 

From  this  it  can  be  shown  that 

ab  +  ba  =  2a.6  (7) 


where 


a  -6  -  a^h„. 


The  following  results  can  be  demonstrated : 

{75,  ci}.  =  0 

7xa7a:  =  a  +  2a^7^;     yf,y,,--^;     y^ay^=-2a 

y^aby^  =  4a'b;     y^abcy^^ -2cba.  (8) 

4.  THE  DENSITY  OF  STATES^ 

We  are  interested  in  computing  the  number  of  states  Q(E\p^p2  .  .  .  p^)  available 
to  a  system  of  n  particles  when  its  total  energy  lies  between  E  and  E  -\-  dE  and 
the  particles  have  momenta  Pi,  P2,  •  •  . ,  p^^ .  To  do  this  it  is  convenient  to  compute 
Q{E;p)  and  q{E;  P1P2)  the  density  of  states  for  one  and  two  particle  systems 
respectively.  The  generalization  to  the  n  particle  system  follows  immediately. 


^  Alladi  Ramakrishan  Handhuch  der  Physik,  III,  524,  Springer- Verlag  (1959). 


k  +  dk  is,  hy  the  fundamental  assumption  of  quantum  mechanics -^^  or  ^^  ^.3 
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Tlie  computation  is  based  on  the  postulate  that  the  number  of  states  available 
to  a  single  particle  is  uniformly  distributed  in  the  six-dimensional  phase  space. 

One  particle  system 

The  number  of  states  available  for  a  particle  of  momentum  between  k  and 

d^k         d^k 

if  ^  ==  1.  The  number  of  states  available  when  |  k  \  lies  between  p  and  p  +  dp  is 

p^dpl{27tf.  (1) 

Hence  the  density  of  states  per  unit  energy  interval  ^  is 

,T.  ^      (    P^    \      i  ^P\        Ep      .         dp        E 

Two  particle  system 

If  p^  and  ^2  ^^^  the  momenta  and  E^  and  ^2  ^^^  energies  of  the  two  particles 
the  total  momentum  and  energy  are  given  by 

P  =  Pi  +  P2 

E  =  E^  +  E^.  (3) 

The  number  of  states  available  in  the  energy  interval  d^  is  just  the  number  of 
states  available  to  a  particle  of  momentum  between  p^  and  Pi  +  dp^  (the  other 
momentum  |>2  being  fixed)  i.e. 

where  d^^  has  to  be  computed  corresponding  to  a  given  d^.  Then  q  is  obtained  as 

Three  particle  system 

In  the  case  of  a  three  particle  system  we  ask  for  the  density  of  final  states  per 
unit  interval  of  total  energy  E  and  per  unit  interval  of  the  energy  of  the  particle 
Avhen  the  two  other  particles  have  momenta  p^  and  pg  respectively.  The 
momentum  P3  of  the  third  particle  is  determined  by  the  momenta  of  the  other 
two  particles.  The  density  of  states  is  obtained  by  multiplying  the  density  of 
states  ^(^;  pi)  corresponding  to  particle  1  by  the  density  of  states  q{E— E^\ 
p  —  p^)  of  the  two  particle  system.  Thus  we  have 


1  In  literature  the  term  "density  of  states"  is  used  rather  loosely,  either  to  denote  the  num- 
ber of  states  in  an  infinitesimal  interval  d^  or  the  number  per  unit  energy  range.  We  how- 
ever wish  to  keep  this  distinction  clear.  To  obtain  the  number  from  the  density  we  multiply 

V 
by  the  factor  -p  dp. 
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When  one  of  the  particles  has  infinite  mass  (and  hence  infinite  energy)  we  have,  on 
setting  ^3  =  wig  =  00 

?(^;  PiPiPz)  -  j2^  ^iPiPi^Pi'  (7) 

The  density  of  states  of  a  7i  particle  system  can  be  easily  obtained  by  considering 
it  to  be  composed  of  two  systems,  one  containing  a  particle  of  energy  and  momentum 
El  and  p^  and  the  other  of  (»  —  1 )  particles  of  total  energy  and  momentum  E  —  Ej 
and  />  —  Pi,  and  applying  the  above  rule  for  composition  successively. 


5.  RELATIVISTIC  TRANSFORMATIONS 

We  here  summarize  some  of  the  standard  results  relating  to  relativistic  transfor- 
mations. These  have  been  used  in  the  main  part  of  the  book  in  two  contexts : 

(1)  The  first  occurs  in  the  theory  of  elementary  particles  where  it  is  assumed 
that  the  laws  of  physics  are  independent  of  the  system  of  reference  in  which  the 
phenomena  are  observed.  The  physical  laws  are  expressed  as  equations  con- 
necting physical  quantities  and  their  variations  with  coordinates.  By  invariance 
under  transformations  is  meant  the  invariance  of  such  equations  and  relationships 
and  not  ofthe  physical  quantities  themselves.  The  systems  of  reference  envisaged 
here  are  those  which  move  with  uniform  velocity  relative  to  one  another.^ 

(2)  The  second  context  in  which  we  use  relativistic  transformations  is  in  the 
study  of  physical  quantities  in  different  coordinate  systems  for  ease  in  compu- 
tation. Such  cases  arise  in  collision  processes  involving  a  finite  number  of  particles. 

Let  the  relative  spatial  coordinates  of  two  events  be  denoted  by  Xj^,  X2,  x^  and 
the  temporal  coordinate  by  x^  {x^^  =  t)  in  the  coordinate  system  S  and  let  their 
corresponding  values  in  the  system  aS"  moving  with  a  Uniform  velocity  v  along 
the  X-axis  of  S  he  x{,  x'^,  x'^  and  x'^ .  These  quantities  are  then  related  by  the 
equations  ^.  _  ^  (^^  _  ^,^^  ^y(x,-  ^x,) 

x^  =  x.^ 

x[^y{t—vxi)^^y{x^  —  ^xi)  (1) 

with  /5  -  V  and  7  =  (1  —  P^)-"f\ 

If  the  relative  velocity  is  in  any  direction,  i.e.  has  components  v^,  v^ ,  v^,  in  the  x,  y,  z 
directions  respectively  we  have  merely  to  write  v^  instead  of  1;  in  the  transformation 
formula  for  x^.  The  other  two  coordinates  have  similar  formulae.  Any  quantity 
whose  components  transform  according  to  the  above  law  is  called  a  four- vector. 
Thus  x^,  x^,  x^  and  ic^  form  the  components  of  a  four- vector.  The  scalar  product 
of  two   four- vectors  a  and   h  is  defined  as  a  •  b  ==  a^b^  —  a^b^  —  a2b2  —  ctsb^. 


1  W.  Pauli,  Theory  of  Relativity,  Pergamon  Press  (1958). 
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This  is  invariant  under  the  above  transformation.  In  particular  the  invariant 
ol  —  af  —  al  —  ^i  is  called  the  square  of  the  "length"  of  the  four- vector  a. 
More  generally  if  we  define  L  as  a  4  x  4  matrix  which  transforms  x  to  x\ 

x'^Ax  is  invariant. 


i.e.  x'  =  Lx  such  that  the  quadratic  form  x^ 

then  the  relation  L"^ AL  =  A  holds.  Here  A  is  the  matrix 


x\ 


X2 


^2 

^3 


1 

0 

0  on 

0 

-1 

0  0 

0 

0 

-1  0 

0 

0 

0  i_ 

and  X  is  the  vector 


(2) 


^4  J  • 

If  we  had  taken  the  fourth  dimension  as  it,  then  L  corresponds  to  a  rotation  in 
four-dimensional  space.  The  transformation  given  in  (1)  is  a  particular  case  of  L 

^^^                                            ^      y    0    0  -py- 

0     10  0 

0     0    1  0 

_-^y  0    0  y  _ 

We  now  give  below  some  examples  of  four- vectors : 
(1)  X,  y,z,t  the  space  and  time  coordinates  of  an  event. 

.2)    -A  _J L  A_ 

^  '  dx'       dy'        dz'    dt 

fOS  ^^  ^^  ^z  . L . 

^  '      V(l  -  u^) '  ]/{!  -  u^) '  J/(1  -  u^) '  j/(l  -  u^) 

where  %,  Uy,  %  ^^^  the  components  of  the  velocity  of  the  particle. 


(4) 


F. 


dE 


1 


y{l~u^)'  ]/{l-u^)'  ]/{l-u^)'    dt    1/(1-1*2) 
where  F  is  the  force  on  the  particle  and  E  its  energy. 

(5)  Px,  Py,  Pz,  E  the  momentum  and  energy  of  a  particle. 

(6)  k^,hy,  kz ,  V  where  k  is  the  propagation  vector  of  a  wave  and  v  is  its  frequency. 
C^)  jx^  jy>  ?2'  Q  *h^  current  and  charge  densities. 

(8)  A^,  Ay,  A~,  (p  the  vector  and  scalar  potentials  of  an  electromagnetic  field. 

For  the  sake  of  completeness  we  give  also  the  transformations  of  some  closely 
related  physical  quantities  which  however  are  not  four- vectors. 
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( 1 )  The  components  of  the  velocity  u  of  a  particle  transform  as 

(2)  We  shall  give  the  transformation  law  for  acceleration  n  under  the  limited 
condition  u  0,  i.e.  we  find  the  acceleration  in  the  momentary  rest  frame  of  the 
particle.  We  then  have  ^^  ^.  ^^^j  _  ^^y,, 

Uy==lL,(l-U^)  (4) 

Uz  =  Uz{l  —U^). 

(3)  If  6  is  the  angle  made  by  the  velocity  u  of  a  particle  moving  in  the  x^  —  .rg 
plane  in  S  with  the  positive  direction  of  the  or^-axis,  the  corresponding  angle 
(9' in  5' is  given  by  ^^-^q 

tan  d'  =  — ^ (5) 

y{u  cos  6  —  v) 

tan  -9  =    -  - , ^, .  (6) 

y{u  cos  d  -\-  V)  ^ 

In  the  non-relativistic  case,  y  =  1  and  usind  =  u'  sin  0',  i.e.  the  velocity 
perpendicular  to  the  relative  velocity  of  the  frames  is  unaffected.  Even  in  the 
relativistic  case,  it  is  not  affected  very  much  if  y  -^  1. 

There  is  one  interesting  result  which  can  be  obtained  as  a  corollary  to  the 
above  transformations  and  which  has  applications  in  collision  problems.  For 
a  given  u'  in  &'  and  0  in  aS  there  are  two  solutions  b\  and  02  possible  for  b'  as 
given  by  the  equation 

^2  +  1  ^    ^ 

V 

where  (x  =  y  tan  d  and  /j,  =  —y.  This  equation  is  obtained  by  solving  equation  (5) 

u 

for  cos  6'.  This  implies  from  (7)  that  there  are  two  values  u^  and  i^g  for  the  velo- 
city in  the  S  frame  corresponding  to  these  two  values  of  0'  in  the  S^  frame.  This 
can  be  easily  demonstrated  geometrically  in  the  non-relativistic  case.  The  velocity 
u  in  S  is  just  the  third  side  of  the  triangle  which  has  to  be  determined  given  the 
two  sides  i;  and  u^  and  the  angle  6  between  v  and  u.  Two  triangles  are  possible 
if  i*'  <  i;;  otherwise,  only  one  exists.  These  cases  are  shown  in  figure  (69).  The 
arguments  are  similar  in  the  relativistic  case  jn  being  double  valued  if  /^  >  1  and 
single  valued  if  /^  <  1 . 

Finally  we  notice  that  if  F' {d' ,  E')diQ'dE'  denotes  the  number  of  particles 
emitted  in  the  S'  system  between  the  solid  angles  Q'  and  i3'  +  di2'  and  the  energies 
E'  and  E'  +  dE\  the  distribution  of  the  particles  in  the  S  system  is  given  by 

F{d,  E)dicose)dE  =  F'[d'{e,  E),  E'(d,  E)]Jd{cosd)dE  (8) 

"^^^^  ,     eicose',E') 


a  (cos  (9,  E) 


(9) 
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If  we  wish  to  find  the  angular  distribution  i^(0)d  (cos  6)  in  the  S  system  irrespective 
of  the  energy  of  the  new  particle,  we  have 


F{d)d{GOsd)  ^271  JF[d'{d,  E),  E'[d,  ^)]  Jd(cos0) 


(10) 


where  now 


a(cos(9' 


a(cos0) 


E' 


Fig.  69.  Geometrical  representation  of  ambiguity  in  u  given  u'  {u'  <u). 

Application  to  collision  problems 

In  collision  problems  it  is  often  found  convenient  to  describe  the  phenomena 
in  the  centre  of  mass  system  S'  of  the  particles  and  finally  transform  back  to  the 
laboratory  system  S.  In  order  to  carry  out  these  transformations,  we  have  merely 
to  compute  the  velocity  of  the  centre  of  mass  relative  to  the  laboratory  system. 
The  computation  of  this  velocity  is  done  by  a  consideration  of  suitable  invariant 
products  of  four- vectors  in  the  two  systems. 


Two  particle  collisions 


Laboratory  system 


p[^K 


-P2=-Pi'    ^2- 


Centre  of  mass  system 


/>.  E| 


Pz^O,  E, 


p'r  e; 


CM. 


P2'-P'r^2 


Fig.  70.  Energies  and  moments  in  the  laboratory  and  centre  of  mass  systems. 
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Let  US  consider  the  collision  of  two  particles  of  rest  masses  m^  aiyl  m^  the  sub- 
script 2  denoting  the  particle  which  is  at  rest  in  the  S  system.  Let  p^ ,  E^  and 
p^[  =  0),  E^{=  rwg)  be  their  corresponding  momenta  and  energies  and  \etp(=  p^) 
and  E{=  E^  +  m^)  denote  the  total  momentum  and  energy  of  the  system.  The 
corresponding  quantities  in  the  /S'  system  are  denoted  by  primes.  The  velocity 
of  *S'  relative  to  S  is  denoted  by  p.  The  total  energy- momentum  four- vector 
in  the  two  systems  are  then  given  by 

*Sf :  pi  0  0  ^1  +  mj 

,Sf':0  0  0^'.  (11) 

Noting  that  the  coordinate  system  is  so  chosen  that  p^  is  along  the  x-axis,  it 

follows  that 

Oil  y[pi-piE,  +  m^)]  .    (12) 

or 


/*= 


Pi 


Ei  +  m^' 


(13) 


Knowing  /5  all  the  other  quantities  can  be  computed  in  the  S^  system  using  the 
appropriate  transformation  formulae. 

Decay  of  a  single  particle  into  three  particles 

Let  us  consider  a  particle  which  decays  at  rest  into  three  particles  of  equal 
mass  all  moving  in  the  same  plane.  Taking  the  laboratory  system  as  the  one  in 
which  the  original  particle  is  at  rest,  the  configuration  of  decay  can  be  specified 
in  terms  of  the  momentum  p^  of  the  decay  particle  1  and  the  momentum  q  of 
one  of  the  other  two  decay  particles  2  and  3  in  their  centre  of  mass  system.  In 
the  laboratory  the  individual  momenta  of  the  three  particles  are  p^ ,  p2  and  p^ 
respectively.  The  total  momentum  is  zero  and  the  total  energy  18  M  =  w^-i-  W2  +  w^ 
where  w^ ,  w^  and  w^  denote  the  energies  of  the  particles.  Now  let  p'  be  the  momentum 
of  particle  1  in  the  2-3  rest  system  and  w'^  the  corresponding  energy.  The  velocity  v 
of  the  2-3  system  with  respect  to  the  laboratory  system  is  given  by 

P 

taking  the  plane  of  decay  to  be  the  xy-]pla,ne  and  the  rr-axis  in  the  direction  of  pj . 
The  energy-momentum  four- vectors  of  the  individual  particles  and  the  whole 
system  in  the  laboratory  and  the  centre  of  mass  frames  of  reference  can  be 
written  as  follows : 


Laboratory  System 

2-3  centre 
of  mass  system 

Four- 
Vector 

Particle  1 

Pi               0          0    Wi 

Pi'    , 

0,0,           w[ 

{A) 

Particle  2 

P2  cos  a      pisin  tx      0    W2 

qcoBd    , 

q9>m.d   ,    0,           w' 

(B) 

Particle  3 

793  cos /3    -pgsin^    0    W3 

—  q  cos  0 , 

-qsinB,    0,            w' 

(C) 

The  total  system 

0               0           0    M 

Vr       > 

0       ,    0,    {2w'  +  w[) 

(D) 
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To  determine  q,  we  make  use  of  the  fact  that  the  scalar  product  of  A  and  D 
is  invariant.  Therefore 


But 


Hence 


Mwi={2w'  +  w[)  w[  -'p^  =  2 w'  w'^  +  w} 


w 


;=(--«^'.)/1'-(m^.t)' 


^  =— 2^.;^—=- 2- 


^  = 


{M^-3m^-2MwiYI> 


To  describe  the  process  fully  it  is  then  enough  to  specify  co^  =  m  -\-  t  [t  being  the 
kinetic  energy  of  particle  1)  and  the  angle  6  between  the  vectors  p  and  q.  6  can 
be  calculated  from  the  observed  momenta  in  the  following  way.  Taking  the 
scalar  product  of  B  and  D 


But 


Mw2=(2w'  +  w^)  w'  —  p'  q  cos  6 . 


^'  =  (^^  +  irr^^^) 


pi 


(M-w^) 


Mp^ 


{M^  +  m^-2M w^yi-  ' 


Substituting  the  values  of  p'  and  w'^ 

in' 

=  2w'^  + 
Similarly  from  C  and  D,  we  get 


,  2       w'  {M  w^  —  m^)  —  M  pq  cos  d 
"^         ( Jf  2  +  m2  -  2  J/  w^yi^ 

w'  {M  w^  —  m^)  —  M pq  cos  d 


Mw.  =  2w'^-\- 


,2  ^    w' {M  Wi  —  m^)  -}-  Mpq cos d 


2w' 


M{w^-w^  = 


i.e.  cos0  = 


2M pq  cos 6 

2^'        ' 


W.i~  Wo    W 


BPOR      33 
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COS 6  can  be  expressed  also  in  terms  of  the  measured  momenta  Pi,  P2,  Ps  in  the 

laboratory  system  as 

^      wl-vi  {M^  +  m^  -  2  M  w^yi* 

cos  u  = 

w^  +  w^  2p^q 


^Pi-Pl\.      (      Pi       \T' 
2pq     [        \M-wJ  \    ' 


6.  INVARIANT  FUNCTIONS^ 

We  shall  now  summarize  the  derivation  of  the  various  invariant  functions 
associated  with  free  particle  propagators  which  are  expressed  in  terms  of  the 
wave  functions.  It  will  be  convenient  to  define  these  functions  first  as  a  sum 
over  the  three  dimensional  momentum  p  and  later  express  them  as  integrals. 
Let  ip  be  the  wave  function  with  one  or  more  components  and  let  us  associate 
>vith  it  the  propagator  G{2,  1)  given  by 

G(2,  1)=2'Vp(2)Vp(1).  (1) 

P 

where  (1)  and  (2)  denote  space-time  points.  If  we  assume  the  relativistic  relation 
E^  =  p^  -\-  m^  between  the  energy  E  and  momentum  p,  E  has  two  roots  given 
by  ±  Ej,  where  Ep  =  +  ^p^  +  m^.  Then  6? (2,  1)  can  be  written  as 

0{2,  I)  =  2J  ¥p'H2)w^p'^Hl)  +  2  V^^-H2)y>^f^Hl) 
p  p 

=  G,{2,l)^G_{2,l),  (2) 

where  G+  and  (t_  are  the  sums  corresponding  to  the  positive  and  negative  values 
of  the  energy. 

We  shall  define  two  associated  propagators  G'  and  Gq,  for  reasons  which  will 
be  apparent  presently,  as 

G'{2,l)=G^{2,l)-G_{2,l)  (3) 

and  Go{2,  I)  =  G{2,  1)  for     t^  >  t^ 

=  0  for     ^2<^i-  W 

From  these  we  can  define 

«,,2.  1)  =  0,(2.  1)  -  0.(2.  1,  =  ^'(2.1)  +  ^(;.-^)^(2.J),  (5) 

where  e  (^g  —  '1)  =  ±1  depending  on  ^2  —  h^  ^^ 
We  now  express  these  functions  as  integrals 

«.  (2, 1)  =/ ai;  -j^  Vp'*(2)  v'p''W  d> 


^  G.  ELallen,  Handhuch  der  Phyaik,  Vol.  6,  Part  I,  Springer- Verlag  (1958). 
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where  xp^^^  denotes  the  wave  function  with  positive  values  of  the  energy  and 
y)p  the  wave  function  corresponding  to  the  four  momentum  p.  Using  the  contour 
integral  representation  for  the  ^-function,  we  write 


^^^'''^=f   IyeI-^-W-^ 


y>p(2)flw 


-  (2^rJ''P  J     2K      P.-E„       "^^^ 


d> 


(7) 


0+<«f) 


where  p^  is  treated  as  a  complex  variable,  and  the  contour  C+  (E^)  is  an  anticlock- 
wise (+)  contour  round  E^  (see  Fig.  71).  Similarly  G_  is  given  by 

ft 


j      2Ej,  2ni  {2nf      -    ■    ^  ^ 


"(2" 


nf     J 


0'      V^  +  E^ 
1      V^^(2)V';(1) 


2^^      V,  +  E, 


^'V 


C+{-Ep) 

where  C+{ — Ep)  is  the  anticlockwise  contour  round  — Ep  (see  Fig.  71). 


(8) 


(9) 


Fig.  71.  The  contour  C 


33' 
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Noting  that 


2.    .^       .         „    ,    ..       ^.      ^  (10) 


^{_J _l_l  =  __L_ 

ictions  O,  O*  and  0^  as 
over  suitably  defined  contours.  Thus 


we  can  MTite  the  functions  Oy  O*  and  Oj>  as  integrals  of  the  same  function 

Vp(2)y;(i) 


c 
where 

=  C^(Ej,)-\-C.{-Ej,),  (12) 

where  C+  and  C_  denote  anticlockwise  and  clockwise  contours  respectively.  Thus  C 
is  represented  by : 


Fig.  72.  The  contour  C. 


Similarly 


where 


wpmvlw 


-".-"-w/^fiS^*''  «■»> 


C"  =  C^  {E^)  +  C+  (-^^)  (see  Fig.  71)  (14) 

and  Gp  =  G+         for     t^  >  ^i 

=  —  Gf_     for     ^2  <  ^1  (15) 

that  is,  the  function  -^-^ — ^^ —  has  to  be  integrated  over  the  contour  C+  (E^) 
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if  ^2  >  ^1  and  over  (7_  (—E^)  if  ^2  <  ^i-  This  is  achieved  by  taking  the  contour  C^ 
as  shown  in  Fig.  73. 

This  contour  can  be  closed  downwards  when  t^  >  t^  since  p^^  will  then  have  a 
large  negative  imaginary  part  and  hence  e~*^*^'*~*i^  is  of  the  form  e"'*^*""'^^  and 


P4=  +  Ep 


Fig.  73.  The  contour  Cp. 


Fig.  74.  The  contour  -  Cp  •  {x^  +  0). 


as  oc  tends  to  00 ,  the  integrand  tends  to  zero  and  the  closure  can  be  made  with- 
out any  contribution  to  the  integral.  The  contour  being  anti  clockwise,  this  gives 
6^+ (2,  1).  In  a  similar  manner,  for  ^2  <  ^u  the  closure  can  be  made  in  the  upper 
half  plane  (see  Fig.  75).  The  contour  being  anticlockwise,  we  now  obtain  — 0_  (2,  1). 
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The  same  result  can  be  obtained  by  taking  a  contour  along  the  real  axis  from 
+  00  to  — 00  and  assuming  that  the  singularities  occur  at  Ep  —  ie  and  — (Ep  —  ie) 
and  making  c  ->  0. 


If  we  integrate  the  function 


v>(2)y;(i) 


Ei 


over  the  contour  which  is  a  line 


darallel  to  the  real  axis  and  above  the  singularities  ±  Ep  we  obtain  by  the  above 
arguments  0^=0^  +  0.     if    ^,  >  ^ 

=  0  if    <a<^i.  (16) 


Fig.  75.  The  contour  C^.  •  {x^ 


On  the  other  hand,  integration  over  a  contour  which  is  a  line  parallel  to  the 
real  axis  and  below  the  singularities  ±  Ep  yields 

=  -(G^  +  G_)    if    t^<t^.  (17) 

Writing  e~'*"^  for  the  wave  function  y)  we  identify  the  G  functions  with  the 
known  invariant  functions  as 


0,  =  iA,; 
—  Ar 


G.^iA. 


G 


GR=-iA, 


(18) 

(19) 

(20) 
(21) 


On  writing  the  spinors  for  y),  we  identify  Gp  as  the  Feynman  propagator  Kp. 
It  is  also  customary  to  define  a  function  Sp  as 

8p=-2Kp  (22) 

such  that  Sf=  {p  +  m)Ap.  (23) 
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7.  QUANTIZATION  OF  THE  ELECTROMAGNETIC  FIELD^ 

When  we  try  to  quantize  the  electromagnetic  field,  i.e.  to  obtain  an  expression 

for  Afj^  [x)  corresponding  to  field  operators  we  immediately  run  into  the  difficulty 

d  A  (x) 
that  the  Lorentz  condition  — -—- —  =  0,  which  is  so  essential  in  the  classical 

dx^ 

theory  for  establishing  that  the  equation  for  the  potentials  n]A^^(x)  =  0  cor- 
responds to  the  Maxwell  equations,  cannot  be  taken  over  into  the  quantized 
theory  as  an  operator  identity.  For,  this  would  contradict  the  commutation 
relations  since 


'^"^"^  ,  AA^'] 


dx^ 


£"'-"*»■ 


If  on  the  other  hand,  we  ignore  the  Lorentz  condition,  we  are  led  to  the  result  that 
the  energy  is  not  positive  definite,  which  is  untenable.  The  two  difficulties  are 
connected  with  each  other  and  when  the  Lorentz  condition  is  properly  incorporated 
in  the  quantized  theory,  the  states  of  negative  energy  will  not  appear  as  we  shall 
see  presently. 

Fermi  ^  tried  to  get  over  the  difficulty  in  imposing  the  Lorentz  condition  as  an 
operator  equation  by  observing  that  to  get  the  classical  electromagnetic  theory 
as  a  limiting  case,  it  is  not  necessary  that  all  the  Maxwell  equations  should 
correspond  to  operator  identities  in  the  quantized  theory,  but  that  it  is  sufficient 
if  the  expectation  values  of  the  field  operators  satisfy  these  equations.  He  there- 
fore took  the  equation  ^  . 

^k>==0  (2) 

as  the  "auxiHary"  condition  in  the  quantized  theory.  Using  the  expansion  of 
Ai^(x)  in  terms  of  the  creation  and  annihilation  operators  of  the  photon  correspond- 
ing to  the  four  degrees  of  freedom,  this  can  be  written  as 

[a<3)(fe)  +  ^a(4>(fe)]  |v^>  =  0  (3a) 

[a*(3)(fe)  +  ia^^''\k)]  Iv^)  -  0,  (3b) 

where  a^^^  and  a*^^^  are  the  annihilation  and  creation  operators  respectively 
of  the  longitudinal  photon  and  a^^^  and  a*^^^  are  the  creation  and  annihilation 
operators  respectively  of  the  scalar  photon.  (The  change  in  the  role  of  the  operators 
in  the  case  of  the  scalar  photon  is  connected  with  the  fact  that  the  fourth  component 
of  A^{x)  is  an  imaginary  quantity  while  the  other  components  are  real.)^  We  see 
that  the  auxiliary  condition  affects  only  the  longitudinal  and  scalar  photons 
and  leaves  the  transversal  degrees  of  freedom  uninfluenced.  A  possible  state  vector 


^  In  this  section  the  fourth  component  of  four  vectors  is  i  times  the  corresponding  quan- 
tities in  the  previous  chapters.  Thus  A^^  here  corresponds  to  iA^  of  the  earher  chapters. 

2  E.  Fermi,  Rev.  Mod.  Phys.  4,  87  (1932). 

*  See,  e.g.  G.  Kallen,  "Quanten  Electrodynamik"  in  Handhuch  der  Physik,  Vol.  V 
Part  I,  p.  187,  Springer-Verlag  (1958). 
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which  satisfies  the  auxiliary  condition  can  be  written  as  a  product 

lv>=  \y>T>n\^{k)>  (4) 

k 

where  |yr>  contains  only  the  transversal  photons  and 

«<»)»!(•)  «(»)n(*)       yn'^^'ln^*'] 

where  nS^^  and  vS*^  are  the  number  of  longitudinal  and  scalar  photons  respectively, 
present  in  the  state.  Substitution  in  (3)  gives  a  pair  of  equations  for  the  consist- 
ency of  which  we  must  have 

anC).n(«)  =  C'^n(.).  „(«)(- i)«^^  (6) 

where  the  constant  C  is  to  be  determined  by  the  normalization  condition 

<0„|0„>=1.  (7) 

Fermi*s  method  thus  gives  a  precise  prescription  for  the  mixing  of  the  longi- 
tudinal and  scalar  photons  which  can  appear  in  a  physically  realizable  state. 
These  degrees  of  freedom  are  thus  eliminated  and  only  the  transversal  photons 
are  allowed  to  exist,  their  energy  being  equal  to  the  total  energy  of  the  electro- 
magnetic field.  (The  energy  of  the  longitudinal  photons  cancels  exactly  with  that 
of  the  scalar  photons.)  Thus  the  total  energy  of  a  state  is  always  positive  definite. 
Though  the  above  method  of  inclusion  of  the  Lorentz  condition  looks  elegant 
at  first  sight  there  are  mathematical  difficulties  associated  with  it.  Thus  for 
instance,  it  looks  as  if  the  auxiliary  condition  (2)  is  in  contradiction  with  the 
commutation  rules,  since 


<. 


dAJx) 


0x^ 


.  A,(^')] 


'>-'i"'-"+». 


instead  of  being  zero  as  we  should  expect  from  (2).  But  this  is  due  to  the  fact 
that  the  vector  j0;t>  is  not  normalizable  with  a  finite  C.  Thus  (2)  would  lead  to 
an  expression  of  the  form  0  •  oo  which  must  be  first  defined  by  a  limiting  proce- 
dure. It  is  found  that  this  can  be  done  only  at  the  cost  of  choosing  infinite  gauge 
functions.^ 

The  difficulties  connected  with  the  unnormalizable  state  vectors  arise  from 
the  fact  that  in  (3),  both  creation  and  annihilation  operators  are  present  which 
leads  to  the  solution  (4)  containing  state  vectors  with  an  infinite  number  of 
particles  which  cannot  be  always  explicitly  normalized.  Further  the  conditions  (3) 
would  mean  not  only  that  longitudinal  and  scalar  photons  are  not  present  but 
also  that  such  photons  cannot  be  emitted  which  latter  condition  is  too  stringent. 
Gupta  and  Bleuler^  proposed  that  the  auxiliary  condition  (3  a)  should  be  modified 


1  F.  CoESTER,  Phys.  Rev.  83,  798  (1951) ;  also  G.  Kallen,  loc.  cit.  p.  196  and  the  following. 

2  S.  N.  Gupta,  Proc.  Phys.  Soc.  Lond.  A  63,  681  (1950);  64,  850  (1951);  K.  Bleuler,  Helv. 
Phys.  Acta  23,  567  (1950). 
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such  that  only  annihilation  operators  are  present  in  it.  But  this  cannot  be  done 
by  merely  dropping  equation  (3  b),  since  (3  a)  itself  contains  both  creation  and 
annihilation  operators.  Thus  we  must  choose  the  following  representation  for 

<^  +  l|a*(4)|w>  =  <^|a(4)|^  +  l>  =  1/^  +  1  (9) 

thus  bringing  the  roles  of  the  creation  and  annihilation  operators  for  the  scalar 
photons  also  in  line  with  those  for  the  other  components  of  the  electromagnetic 
field.  But  this  would  violate  the  reality  condition  of  the  Af^{x),  the  fourth  com- 
ponent also  being  real. 

To  obviate  this,  Gupta  introduced  the  indefinite  metric  operator  rj  originally 
used  by  Dirac-"^  in  a  different  connection  which  has  the  following  properties: 
The  norm  of  a  state  vector  is  defined  as 

Norm     I  ^>  =  '(y)  |  ^  |  'V^>  •  (10) 

To  ensure  the  reality  of  the  norm,  rj  must  be  hermitian.  Similarly  the  expectation 
value  of  an  operator  F  is  defined  by 

F^  <y)\r]F\y)y.  (11) 

With  these  new  definitions  the  norm  of  a  state  vector  can  also  be  0  or  negative 
and  a  hermitian  operator  need  not  necessarily  have  real  eigenvalues  which  is 
what  we  require.  For  the  component  of  the  electromagnetic  field  we  thus  impose 

=  <:y}\Ajc{x)  rj  \y)y     j 
or 

[Aj,{x),rj]^0;     A:  ==  1,  2,  3  (13) 

and 

<ip\rjAi(x)\y)y  = —(y)\A^(x)rj\ipy  (14) 

or 

{A,{x),rj}  =  0.  (15) 

The  auxiliary  condition  in  the  Gupta-Bleuler  formalism  will  be 

[a<3)(fe)  +  ^a(4)(fe)]|  t^>  =  0  (16) 

both  the  operators  appearing  being  annihilation  operators  now.  In  x- space,  the 
condition  will  be  dA^'^Hx) 


8x. 


\W>  =  0,  (17) 


where  A^^^x)  stands  for  the  negative  frequency  part  of  A^{x).  If  we  write,  as 
before,  an  arbitrary  state  vector  as 

w>=  \wT>n\^u>,  (18) 

k 


1  P.  A.  M.  DiRAC,  Proc.  Boy.  Soc.  A  180,  1  (1942). 
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then  equation  (16)  contains  only  a  single  condition  for  the  quantities  |0fc>. 
As  the  fundamental  system  of  solutions,  we  can  choose 

|0(O)>  =  |o,O> 

|0(i)>  =  |l,O>  +  t|O,  1> 


|<2><">>=  i(ty|/(^)|n-r,r>  (19) 

with  the  orthogonahty  condition 

<0(n)  I  ^  I  0n'y  _  Q      i-Qj.      ^^  ^,  J20) 

Also  the  normalization  condition  gives 

«?»„|^|0''>=  J  (-!)'(  Md„„  (21) 

r  =  0  \  ^  / 

which  means  that  none  of  the  norms  is  negative  and  only  the  state  |0^^^>,  with 
no  longitudinal  or  scalar  photon,  has  a  norm  different  from  zero.  Thus  in  the 
Gupta-Bleuler  formalism  the  state  vector  |0<<^^>  and  the  state  vector  |0<o)> 
+  i7C'^"^(/c)  |^^"^(fe)>  (where  (7<~>(fc)  are  arbitrary  coefficients)  are  equivalent. 

n#:0 

This  ensm-es  that  we  will  always  have  to  do  with  normalizable  state  vectors. 
Further  if  we  evaluate  the  expectation  value  of  the  Hamiltonian  in  the  state  j^>, 
it  is  found  to  be  equal  to  the  energy  of  the  transverse  photons  alone  and  indepen- 
dent of  the  mixing  of  the  longitudinal  and  scalar  photons. 

We  thus  see  that  the  Gupta-Bleuler  method  which  uses  all  the  four  components 
of  the  electromagnetic  field  on  the  same  footing  and  thus  always  exhibits  the 
relativistic  invariance  explicitly  provides  a  satisfactory  way  of  quantizing  the 
electromagnetic  field. 

8.  A  RE-EXAMINATION  OF  THE  EQUIVALENCE  OF  THE 
FEYNMAN  AND  S-MATRIX  FORMALISMS 

It  is  well-known  that  the  results  of  quantum  mechanics  are  meaningful  only 
under  a  probabilistic  interpretation  of  the  squares  of  the  modulus  of  complex 
quantities  like  the  wave  functions  or  matrix  elements.  The  principle  of  super- 
position however  applies  only  to  the  complex  amplitudes  occurring  in  the  theory 
and  not  to  the  positive  definite  quantities,  the  squares  of  their  moduli,  as  in 
classical  probability  theory.  It  has  been  a  long  held  belief  of  the  author  that 
as  long  as  this  principle  is  safegimrded,  the  methods  of  stochastic  theory  should  find 
direct  application  in  the  study  of  quantum  mechanical  systems  evolving  with  time} 
This  belief  was  encouraged  by  the  possibility  that  the  Feynman  formalism  can  be 
viewed  from  a  new  angle  by  suitably  imbedding  the  methods  of  inverse  probability 


^  A  semi-expository  paper  entitled  "Stochastic  methods  in  quantum  mechanics"  is  to  be 
published  shortly. 
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into  the  study  of  a  time-dependent  electrodynamic  system.  The  faith  in  this 
approach  was  further  strengthened  by  the  ''deduction  of  the  well-known  expres- 
sions for  field  operators  through  perturbation  expansions  of  older  quantum 
mechanics".  It  was  felt  therefore  that  the  equivalence  of  the  Feynman  and  field 
theoretic  methods  must  be  proved  in  a  direct  and  transparent  manner  without 
recourse  to  complex  algebraic  methods  and  notation  hitherto  used.  Such  a  proof 
is  now  presented  1  and  in  order  to  realise  the  simplicity  of  the  new  approach  in 
comparison  with  the  older  methods,  it  is  found  necessary  to  re-examine  in  detail 
the  subtle  variations  in  the  meaning  of  the  terms  such  as  "intermediate  states", 
*' virtual  particles",  "kernel  functions",  "propagators"  and  "perturbations". 
We  also  need  some  simple  lemmas  from  matrix  and  probability  theories.  Thus 
we  divide  this  appendix  into  four  parts : 

(i)  Mathematical  preliminaries  dealing  with  lemmas  in  matrix  and  inverse 
probability  theories, 

(ii)  A  new  viewpoint  regarding  the  kernel  function  formalism  of  Feynman, 

(iii)  Interaction  term  in  field  theory, 

(iv)  Derivation  of  the  Feynman  expansion  from  the  /S-matrix  formalism. 

IVIathematical  preliminaries 

Let  jr(0  be  a  column  vector  of  n  components  n{i',  t)  with  i  =  1,2,  .  .  .,n 
varying  with  t  according  to  the  equation 

^'*'  =  [4(«)]«W,  (1) 

where  [A  {t)\  is  a  matrix  function  of  t.  By  a  solution  of  the  equation  we  mean  the 
determination  of  jr(^2)  given  jt(^i)  when  ^2  —  h  i^  finite.  From  the  infinitesimal 
transformation  jt{t  +  A)  =  {l  +  [A  (t)]  A}  jt(t)  +  O(A^)  (2) 

A  being  positive,  the  solution  jt(^2)  for  ^2  >  h  i^  obtained  on  iteration,  as 

:^{h)=  UAt2,h)Mh)  (3) 

where  Uj^  (^2 ,  t^)  has  the  integral  expansion 

1  +  I  ^  (Ti)  dTi  +  /   J  A  (T2)  A  (Ti)  dTa  dzj  +  •  •  • 

...  +  /    /.../j(Tj...^(Ti)dT,...dTi  (4) 

and  the  time  variables  in  the  integrand  of  the  n-th  order  are  such  that 


^  An  inchoate  form  of  this  proof  was  suggested  earlier  by  the  author  in  association  with 
N.  R.  Ranganathan  and  R.  Vasudevan.  But  at  that  time  a  unique  prescription  for  ordering 
the  operators  in  the  interaction  term  was  not  available  and  the  validity  of  applying  stochastic 
methods  was  not  clear. 
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However  (2)  is  also  valid  when  A  is  negative  so  that  nitz)  when  t^  <  t^  can  still 
be  obtained  in  terms  of  :t(^),  i.e.  we  trace  back  the  evolution  of  jr(0  from  t^  to  t^.^ 
In  this  case,  we  have 

UA{k>  y  =  1  4-  (-  1)  /^(Ti)  dTi  +  (-  \?j  fA{r^)A{r,)  dr^dTi  +  •  •  • 

...  +  (-ir/  f'"fA{T,)A{r,.,)...A{r,)dr,,,..dT,  (6) 

where 

It  is  to  be  noted  that  the  integration  in  this  expansion  is  from  ^g  or  t,  to  t^ .  If 
we  wish  to  retain  the  same  form  for  U^(t2,  t^)  in  both  cases  we  can  drop  the 
factor  (—1)"  in  the  above  expansion  and  integrate  from  t^  to  ^g  and  order  the 
variables  T„,Tn_i,  ..  .,Ti  in  the  direction  t^  to  t^  which  implies  that 

^1  <  Ti  <  Tg  •  •  •  <  T„  <  ^2      if      ^2  <  ^1 

and 

^2  >Tn  >T„_i  •  •  •  >Ti  Xi      if      <2>^1- 

However  if  we  wish  to  have  the  upper  limit  greater  than  the  lower  limit  we 
have  to  use  the  form  (5).  The  factor  (—1)  may  be  ascribed  to  A  (t),  i.e.  we  replace 
every  ^  (t)  in  (4)  by  — A  (t)  and  alter  the  limit  of  integration  to  obtain  (6). 

We  now  consider  the  equation 

-|J'"=[^  +  B(«)]^W.  (7> 

The  solution  for  t^  >  ^i  is 

:!t{t2)=U{t,J,)jt{t,)  (8) 

where 

U{t^,  t,)  =  UAk,  h)  +  /  UAh.  Ti)  B{T^)  U^Ti,  t,)  dTj  +  •  •  • 
ti 

'''-^lf''-fUA{t2,rn)B{TjUAT„,T^.,)...U^,{T2,[r,)B{T,)U^{T^,t,)dTn...dri, 

(9) 

For  ^2  <  ^1  we  merely  replace  ^(t)  by  — B{r)  and  integrate  from  ^g  to  t^.  There 
is  however  no  need  to  replace  Uj^  by  —  C7^  since  for  ^2  >  h  t^®  expansion  of 
UA{'^k>'^k-i)  with  T;fc<T;k_i  itsclf  in  the  integrand  implies  the  change  of  A 
to  — ^.  In  the  above  expansion  we  have  treated  J?  as  a  perturbation  on  A  and  Uj^ 
is  the  transformation  function  in  the  absence  of  the  perturbation  B.  In  quantum 


^  This  is  the  basis  for  dealing  with  questions  relating  to  inverse  probability  in  stochastic 
process  where  tc  {t)  is  a  probability  vector  and  [A  (t)]  a  stochastic  matrix :  see  for  example, 
Alladi  Ramakrishnan,  "Probability  and  Stochastic  Processes",  Handbuch  der  Physik,  Vol.  Ill, 
Springer- Verlag  (1959). 
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mechanics  B  corresponds  to  ^i^t  ^^^  ^  to  Hq.  HA  is  time -independent  and  the 
eigenvectors  of  A  are  u^,  U2,  .  .  .,Un  with  eigenvalues  ^1,^2,...,^^  then 
^Aih)  h)  =  e"^('^~^^^  can  be  recognized  to  be 


A  new  yiewpoint  regarding  the  kernel  function  formalism  of  Feynman 

We  shall  now  apply  the  results  of  the  previous  section  to  the  quantum  electro- 
dynamics involving  an  electron.  In  chapter  II  we  arrived  at  a  perturbation  ex- 
pansion for  the  matrix  element  bf^  in  the  case  of  the  non-relativistic  Schrodinger 
equation  as 

b,iih,to)  =  »,i  +  /d«i   /  d'x,ipf{l)  H'{l)y,i{l)  +  •  •  • 

t  tn  f  a  +  00  +00 

+  /d^.^/d^^_i.../d^i    /    d^X^...    f    d^X^ 

t„  t^  t,  -00  -  00 

i;[V),Wff'(«)y„._.(»)V*™„_.(»-l)fl'(»-l)...y*„,(l)ff'(l)v.ai)],    (10) 

where  H'{k)  is  the  perturbation  at  k,  f  and  i  refer  to  the  final  and  initial  states 
and  m]c  refers  to  the  intermediate  states,  the  sum  ^  running  over  all  mj^ .  While 

the  integration  over  each  space  variable  extends  from  — 00  to  +00,  the  time 
integration  is,  of  course,  ordered.  The  term 

SWm,C^+'^)WmM  (11) 

occurring  in  the  integrand  is  identified  to  be  the  kernel  iT  (A;  +  1,  ^).  In  extending 
the  above  result  to  the  Dirac  equation,  it  was  shown  that  a  modified  kernel  was 
necessary.  We  shall  now  view  this  modification  from  a  new  angle.  We  postulate 
that  in  the  scattering  of  a  Dirac  particle  if  a  transition  occurs  from  a  positive 
energy  state  \p^^^  at  tj^  to  a  positive  energy  state  ip^j^  at  t^-\-  A,  we  assume 
that  this  state  is  propagated  to  tj^^i  with  tj^^i^  t^.  li  on  the  other  hand  \p^^  (k) 
is  a  negative  energy  state,  we  assume  that  it  is  the  result  of  a  transition  due  to 
a  perturbation  acting  at  vertex  (k  +  \)  with  t^^^<^tj^m.  the  time  interval  between 
^A;  + 1  —  ^  ^J^d  h  •  We  therefore  trace  back  the  state  from  tjg  to  tj^^i  and  then  to 
h+i  — zl,  i.e.  to  a  state  before  the  perturbation.^  If  this  state  is  still  a  state  of 
negative  energy,  we  continue  to  trace  it  back  and  if  it  has  positive  energy  we 
study  its  evolution  forward  in  time.  When  we  trace  back  to  tji._^.i  — A  we  replace 


^  No  interesting  consequence  follows  if  we  consider  it  to  be  the  result  of  a  perturbation  at 
t^.  since  that  has  been  considered  in  the  transition  from  ipmic_i  to  ipnik-  What  we  are  doing  is 
immediately  justifiable  if  we  interpret  the  procedure  in  the  Dirac  hole  theory.  A  transition 
to  a  negative  energy  state  implies  the  filling  of  the  hole  which  must  be  in  existence  at  that 
time.  We  then  ask  when  was  this  hole  created? 
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the  perturbation  H'{k+  1)  by  —H*{k-\-  1).  Thus  we  use  the  kernel  defined  as 
K{k+\,k)=  2;y>{k+l)ip{k)     for     h^^>h 

B>0 

and 

=  Uvi^+l)ipiJc)     for     h^^<h  (12) 

and  (Utach  the  positive  or  negative  sign  to  the  perturbation  a/  ^t  +  i  according  as  E 
is  positive  or  negative.  On  shifting  the  negative  sign  to  the  negative  energy  part 
of  the  kernel  we  obtain  the  Feynman  kernel. 

The  above  considerations  would  be  very  formal  except  for  the  fact  that  it 
leads  to  a  unique  prescription  for  writing  the  interaction  term  in  the  language 
of  quantum  field  theory. 


Interaction  term  in  quantum  field  theory 

It  has  long  been  realised  that  since  the  interaction  is  written  as  a  product  of 
field  operators  characterized  by  the  same  space-time  point,  there  is  an  ambiguity 
in  **time  ordering"  of  such  operators,  which  has  to  be  resolved  before  applying 
algebraic  methods  based  on  Wick's  theorem.  In  quantum  electrodynamics 
we  have  rr         -  a 

where 

W  =  2  [bpMp)  e*^-^  +  d;  v(p)  e-*^T  (13) 

p 

where  u{p),  v{p)  are  the  electron  and  positron  spinors  respectively  and 
v(p,  E)  =  u( — p,  —E)  when  E  is  positive.  For  convenience  in  notation  tp  can  be 

"""**"*'  m=2{K(k)  +  dl(k)l  (14) 

P 

where  hj,  {k)  denotes  that  the  usual  spinor  and  the  exponential  terms  corresponding 
to  the  space-time  vertex  (k)  are  attached  to  the  operator  h^ .  Considering  only 
fermion  operators  and  ignoring  the  photon  operator  A^  for  the  present,  the  bi- 
linear term  is  ^  ^.*       7  ^  ^  .7         <j*  n 

fv^SlK  +  ^plSlW  +  dl^ 

p  p' 

=  2  [^;  V + h  4'  +  4  4'  +  4  vj  •  (i^) 

pp' 

The  operators  correspond  to  the  same  space-time  point  and  since  we  know  that 
at  any  vertex  one  of  the  four  fundamental  processes  can  occur,  the  operator 
should  occur  in  pairs  but  not  necessarily  in  the  order  prescribed  by  iptp.  In  fact 
we  should  provide  a  unique  prescription  for  the  choice  of  the  correct  order  and 
this  is  done  if  we  apply  the  arguments  of  the  previous  section. 

The  process  of  pair  annihilation  at  t  is  interpreted  as  the  transition  of  a  posi- 
tive energy  electron  at  <  to  a  negative  energy  state  at  t  +  A,  the  perturbation 
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acting  in  the  interval  A  and  hence  the  electron  destruction  operator  should  be 
placed  to  the  right  of  the  positron  destruction  operator  dp  in  the  interaction  term. 
In  the  case  of  pair  creation  in  the  interval  between  t  —  A  and  t,  we  trace  the 
negative  energy  state  of  the  electron  at  t  back  to  a  positive  energy  state  at  ^  —  A 
so  that  in  this  case  d^  is  placed  to  the  left  of  6|, .  For  electron  and  positron  scatter- 
ing the  creation  operators  are  naturally  placed  to  the  left  of  the  annihilation 
operators.  On  the  basis  of  the  above  arguments,  we  postulate  that  the  interaction 
term  at  a  space-time  point  k  should  read  (ignoring  boson  operators  and  y-matrices) 

Pi  ■  Pi 

i=l,2,3,4 

=  ^^W  (16) 

<P}   « 

where  the  bracket  [k]  contains  one  of  the  four  types  of  terms  b^b,  d^  d,  d^b^ 
+  d^b^  and  the  sum  over  oc  indicates  the  sum  over  all  the  four  such  terms.  The 
sum  over  <29>  means  a  sum  over  the  momenta  Pi,  pi-  This  interaction  term  differs 
from  (15)  in  that  the  positron  creation  operator  d'^  stands  to  the  left  of  d  or  b^. 

Derivation  of  the  Feynman  expansion  from  the  8-matrix  formalism 

On  the  basis  of  the  above  interaction  term,  it  is  possible  to  establish  the  equi- 
valence between  Feynman  and  field  theoretic  perturbation  formalisms  in  a 
rigorous  and  straightforward  manner.  The  integrand  of  the  n-th  order  term  in 
the  /S-matrix  expansion  (ignoring  the  photon  operators  for  the  present)  can  be 

written  as  H^^^  =  2JU[n][n  -  I] .  .  ,[1]  (17) 

<p>  « 
where  the  time  variables  are  ordered, 

i.e.     ^n>^n-i  •  •  •  >^2>^l- 

Considering  only  electron  scattering  let  us  take  the  matrix  element  ofH^^^  between 
initial  and  final  one  particle  states,  say  electrons  of  momenta  p^  and  jpg  respectively 

„<6^Jfl<«>i6;.>„,  (18) 

where  no  integration  over  space  and  time  variables  is  as  yet  performed.  We  shall 
now  show  how  this  reduces  to  the  integrand  of  the  Feynman  expansion  in  an 
n-ih  order  process.  The  extension  to  any  other  electrodynamic  process  follows 
without  any  change  in  principle.  We  have  ignored  for  the  moment  the  photon 
operator  A^^  the  inclusion  of  which  will  be  discussed  presently. 

We  are  interested  only  in  the  terms  in  H^'^^  which  give  a  non-vanishing  contri- 
bution to  the  matrix  element.  We  shall  call  the  product 

[»][»-l]...[l]  (19) 
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a  typical  term^  of  H^**\  We  now  adopt  the  following  procedure  in  rearranging 
the  terms.  Since  the  initial  electron  must  disappear,  some  bracket,  say  [i]  should 
contain  bp^  and  we  move  this  bracket  to  the  left  of  6j^ .  This  does  not  involve 
a  change  in  sign  since  we  move  the  bracket  as  a  whole  and  not  a  single  operator. 
This  bracket  [t]  will  in  addition  contain  a  b^.  or  dp>  to  its  left.  If  there  is  a  6j/ 
then  another  bracket  say  [/]  with  <,•  >  t^  should  again  contain  a  bj,,  and  we  move 
[j]  to  the  left  of  [»].  The  arrangement  would  look  like 

[n]  .  .  .  [;  +  1][;  -!]...[»+  l][i  -  1]  .  .  .  [/][*]  6+^  >  o  (20) 

where  ^         ^  ^         ^ 

[;■][»]  bl>o  =  [bU  ()■)  6,:(/)][6;.'(i)  6,,(i)]  &;>. 

or  =[dpi{j)b^M[f>Uii)kiiM,yo-  (21) 

In  either  case  we  have  the  term 

=  S'^{Pi)Hv'i)  e-^>'(^^-^'>(6^j6;j).  (22) 

P[ 

Since  bp-W^,^  is  equal  to  imity  we  recognize  the  above  to  be  the  Feynman  kernel 

If  on  the  other  hand  [i]  contains  the  destruction  operator  of  a  positron  dp>^{i) 
to  the  left  of  bp^  (i)  there  should  be  another  bracket  [Ic]  with  tj^  <  t^  which  must 
contain  d^'^{k)  since  the  positron  should  have  existed  before  it  could  be  anni- 
hilated. We  now  shift  [A;]  to  the  left  of  \i\  so  that  the  arrangement  now  reads 

M  .  .  .  [»•  +  l][i  -  1]  .  .  .  [fc  +  l][i;  -  1]  .  .  .  [k][i\  6;>„  (23) 


where 


=  -[d^;(fc)d;;(fc)]K;W6p,W] 


or 


K;(i)6;;(i)][d,;W6j,(i)]=  -[6;;(fc)4;(*;)][d^;(i)6^,(»)].  (24) 

In  either  case  we  have  the  term 

d;:(i)dj,;(i).  (25) 

On  simiming  over  the  7>J  we  get 

E>0 

-Zv  (K)  V  {p[)  e-  ipi  •  (^*  -  ^i)  (4^  dpO     with     h  <  ti ,  (26) 

E<0 


^  In  stochastic  theory  we  speak  of  a  "typical  realization"  or  "typical  realizable  sequence 
of  events"  and  probabilities  are  assigned  to  each  such  sequence.  Here  integration  over  space 
and  time  implies  a  superposition  of  these  amplitudes.  The  integral  amplitude  vanishes  unless 
there  is  energy  momentum  conservation.  But  that  is  no  reason  why  we  should  not  speak  of 
the  amplitude  for  a  typical  sequence  of  events  corresponding  to  n  space— time  points. 

For  a  detailed  comparison  of  the  concept  of  realization  in  quantum  mechanics  and  classical 
probability  theory,  see  Alladi  Ramakrishnan  and  N.  R.  Ranganathan,  T.  Math.  Mech.  In  press. 
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which  is  the  Feynman  kernel  for  the  propagation  of  the  negative  energy  state 
since  dp^d^^  is  unity. 

The  procedure  can  be  repeated  and  since  we  have  uniquely  defined  the  mode 
of  arrangement  for  the  fermion  operators  in  a  manner  such  that  the  operators 
are  ordered  in  the  Feynman  sequence  we  obtain  the  integrand  of  the  Feynman 
expansion.  In  this  procedure  we  note  that  if  both  the  creation  and  destruction 
operators  for  a  certain  momentum  and  energy  are  contained  in  the  interaction 
terms,  they  contribute  to  the  kernel.  Inside  the  interaction  terms,  we  look  only 
for  destruction  operators  corresponding  to  the  initial  state  and  creation  operators 
for  the  final  state.  We  wish  to  point  out  that  our  method  differs  from  the  hitherto 
available  algebraic  methods  in  two  important  respects. 

(1)  We  do  not  just  move  each  operator  separately  but  the  operators  correspond- 
ing to  one  vertex  together,  and 

(2)  We  insist  that  d^  should  always  be  to  the  left  of  d  or  6+  at  any  vertex. 
We  now  point  out  at  the  risk  of  a  pedagogic  explanation  that  b^{l)  represents 

creation  of  a  real  particle  of  momentum  at  the  space-time  point  (1).  A  typical 
term  therefore  represents  the  creation  of  a  real  particle  at  every  space-time 
point.  The  question  of  energy  momentum  conservation  does  not  arise  when  there 
is  no  integration  over  space  and  time,  since  a  bracket  merely  contains  a  product 
of  wave  functions  of  real  particle  states.  If  the  integral  is  to  be  non- vanishing, 
energy  conservation  must  of  course  be  satisfied.  Thus  our  rearrangement  has 
resulted  in  obtaining  the  integrand  in  the  Feynman  formahsm  and  not  the 
integrated  matrix  element.  In  the  kernel  we  have  a  product  of  two  terms,  summed 
over  all  p  or  the  sum  can  be  replaced  by  an  integral  as 

{p)u{p)e-'P-(''^-^^^d^p.  (27) 


/ 


2E„ 


The  kernel  can  also  be  treated  as  an  integral  over  four-dimensional  momentum  p 

1 

(not  corresponding  to  any  real  particle)  weighted  by  — ^ ^ 

p       m 

K{2,  1)  =  J'A^J^^I^ dV.  (28) 

Thus  the  kernel  can  be  regarded  as  a  sum  over  wave  functions  with  such  four 

momenta   with   amplitude  — ^ ^.  Such   a  dual  explanation  is  not  possible 

when  we  take  the  field  operators  singly  and  not  in  pairs. 

We  would  however  prefer  to  think  of  the  kernel  as  representing  the  sum  over 
the  momenta  of  real  particles.  When  we  now  integrate,  say  over  space-time 
points,  the  kernel  connecting  (1)  and  (2)  is  transformed  by  the  product  of  two 
exponential  functions  like  e'^'^*'"^-^  and  e**'<^«~^»^  where  p  is  the  four  momen- 
tum of  a  real  electron  and  q  that  of  a  photon.  This  is  equivalent  to  taking  the 
transform  with  momentum  arguments  p  +  q  which  represents  a  virtual  electron 
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since  the  energy-momentum  relationship  is  not  satisfied  for  p-{-  q.In  momentum 

space  we  speak  of  the  propagator —  of  a  virtual  particle,  but  there  is 

no  meaning  in  speaking  of  kernel  functions  iji  configuration  space  of  a  virtual 
particle.  It  is  convenient  to  regard  the  sum  over  with  real  particle  wave  functions 
and  the  "intermediate"  states  in  the  integrand  as  real  particle  states.^ 

The  introduction  of  the  boson  operators  does  not  introduce  any  difficulty  in 
this  argument  since  these  operators  commute  with  each  other.  Once  the  Feynman 
arrangement  has  been  obtained  we  need  only  to  impose  the  restriction  that  in  a 
typical  term  only  annihilation  operators  corresponding  to  incident  bosons  and 
creation  operators  corresponding  to  emergent  bosons,  should  occur. 

Since  we  have  given  a  unique  prescription  for  ordering  the  operators,  these 
considerations  can  be  extended  without  any  change  of  principle  to  the  case 
when  the  incident  and  emergent  system  consists  of  more  than  one  particle. 

In  getting  (26)  we  have  ignored  the  contribution  from  the  ^-function  when  dj, 
crossed  d^  while  moving  dp  to  the  extreme  right.  If  we  wish  to  take  into  con- 
sideration this  contribution  we  should  use  the  ordering  of  the  operators  as  in  the 
usual  interaction  y^y^y.  We  shall  now  illustrate  this  for  the  case  which  leads 
to  the  negative  energy  part  of  the  Feynman  kernel^  since  this  alone  involves  a 
difference  in  procedure  between  the  two  methods. 

If  a  bracket  [i],  which  now  contains  bilinear  fermion  operators  in  the  order 
determined  by  ipy),  contains  a  bp^  with  a  dp-  attached  to  its  left,  we  obtain 

[n],..[i  +  l]dp-{i)\i-l]...[l]bp^blyo 

where  we  have  shifted  bp^  (the  operators  being  detached  from  their  wave  func- 
tions) through  the  intervening  brackets,  which  results  in  no  change  of  sign  since 
none  of  the  brackets  contains  either  a  bp^  or  a  b^^ .  As  before  we  argue  that  there 
should  be  another  bracket  [k]  with  ^^  <  t^  which  should  contain  a  d^,  and  shifting 
to  its  left  we  have 

M . . .  [i  + 1]  [2  - 1] . . .  dp>  (/)  [b;.  (k)  d;>{k)] . .  .>o 

or 

W  .  .  .  [^  +  1]  [*•  -  1]  .  .  .  dp-  (^•)  [d,-  {k)  d;,  {k)]  .  .  .>o 

In  either  case  we  obtain 

-dp-{i)dl,{k)yo 

As  far  as  the  wave  functions  attached  to  the  operators  are  concerned,  we  have 

[Vp-{i)Up^{i)][Up"{k)Vp'{k)] 
=  Up"  {k)  Vp' {k)  Vp' {i)  Up^{i) 


1  This  of  course  should  be  distinguished  from  the  the  real  intermediate  states  in  the  matrix 
element  which  correspond  to  the  singularities  in  the  propagator  in  momentum  representation. 

2  The  proof  of  equivalence  using  the  usual  interaction  has  been  done  in  collaboration  with 
R.  Thunga  and  T.  K.  Radha. 
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and  with  the  negative  sign  obtained  from  the  commutation  of  the  operators  we 

=  -y;u_E{k)u_E{i)e-^^'^-''^-^^^     with     h<ti, 
p' 

which  gives  an  element  of  the  negative  energy  part  of  the  Feynman  kernel. 

9.  ON  THE  CONNECTION  BETWEEN  SPIN  AND  STATISTICS 

We  give  below  the  proof  of  Luders  and  Zumino^  on  the  connection  between 
spin  and  statistics.  The  theorem  on  the  connection  between  spin  and  statistics 
states  that  particles  with  integral  spin  obey  Bose  statistics  and  particles  with 
half-integral  spin  obey  Fermi  statistics.  Fields  corresponding  to  particles  with 
integral  spin  are  to  be  quantized  with  minus  commutation  relations  and  those 
corresponding  to  particles  with  half-integral  spin  are  to  be  quantized  with  plus 
commutation  (anti-commutation)  relations.  The  connection  between  spin  and 
statistics  was  first  proved  by  Pauli  for  non-interacting  fields. ^ 

Hermitian  spin-zero  field 

Let  q){x)  be  a  hermitian  spin  zero  field  in  the  Heisenberg  representation.  We 
first  postulate : 

I.  The  theory  is  invariant  with  respect  to  the  proper  inhomogenous  Lorentz  group. 

It  follows  that  the  expectation  value  i(p(x)(p(y)yQ  between  physical  vacuum 
states  is  an  invariant  function  of  the  difference  four- vector 

I,. -=  ^/.  — 2//Z-  (1) 

Therefore 

<.v(x)'p(y)y,-  f(S)  (2) 

where  /(|)  depends  only  upon  ^^|^  if  |  is  space-like  whereas  if  it  is  time-like,  /(|) 
can  also  depend  upon  whether  ^^,  points  into  the  future  or  the  past  light  cone. 
It  follows  from  (2)  that 

<[99(;r),99(2/)]>o- 0;     |  (space-like).  (3) 

We  further  postulate  : 

II.  Two  operators  of  the  same  field  at  points  separated  by  a  space-like  interval  either 
commute  or  anti-commute. 

We  have  therefore  only  to  show  that  the  assumption 

<[99(:r),9?(2/)]+>o=  0         (^space-like)  (4) 


1  G.  Luders  and  B.  Zumino,  Phys.  Rev.  110,  1450  (1958). 

2  W.  Pauli,  Phys.  Rev.  58,  716  (1940).  See  also  J.  Schwinger,  Phys.  Rev.  82,  914  (1951); 
91,  713  (1953).  F.  J.  Belinfante,  Physica  6,  870  (1939);  W.  Pauli  and  F.  J.  Belinfante, 
Physica  7,  177  (1940);  N.  Burgoyne,  Nuovo  Cim.  8,  607  (1958). 
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leads  to  a  contradiction.  Equations  (3)  and  (4)  give 

<<P  i'^)  <P  {y)>o  =-0         (I  space-like) .  (5) 

We  also  postulati^ : 

III.  The  vacuum  is  the  state  of  loivesl  energy.  It  follows  by  the  method  of  analytic 
continuation  used  by  Hall  and  Wightman^  that  equation  (6)  holds,  not  only 
for  space-like  |,  but  for  all  |. 

Our  fourth  postulate  is : 

IV.  The  metric  of  the  Hilbert  space  is  positive  definite. 

It  follows  from  (5)  that 

(p(x)Q^(y  (6) 

where  Q  is  the  physical  vacuum.  We  finally  postulate : 

V.  The  vacuum  is  not  identically  annihilated  by  a  field. 

Since  equation  (6)  contradicts  this  postulate,  Eq.  (4)  is  untenable.  Postulate  V 
seems  essential  in  order  that  a  Hilbert  space  can  be  constructed. 

Hermitian  spin  one-half  field 

Let  y)c(x)  be  a  Majorana  spin  one-half  field  so  that 

where  C  is  the  4x4  charge  conjugation  matrix.  We  consider 

<WP  (^)  Wp  iy)>0  =  C'„<5  r4(J/J  <   fa  (x)  WP  (2/)>0  (8) 

where  the  star  denotes  hermitian  conjugation  in  Hilbert  space.  Let 

The  matrix  rj  is  symmetric : 

VccP='Vpc.'  (10) 

Since  (8)  is  the  fourth  component  of  a  four  vector,  it  follows  from  postulate  I  that 

Vocfi  <wAx)  My)>o  =  iiffi^)  (11) 

where  g{^)  is  an  invariant  function  of  |^,.  From  (10)  and  (II),  it  follows  that 
^«/3<[^«(-^),  My)]+>o  =0         ii  space-like) .  (12) 

As  before,  because  of  postulate  II,  we  have  to  show  that  the  assumption 

Vccfi<bpM^  n{y)]>o  =0         (I  space-like)  (13) 

leads  to  a  contradiction.  From  (12)  and  (13),  we  get 

^«/3<^«(^)  Vpiy)>o  =0         (i  space-like)  (14) 


1  D.  Hall  and  A.  S.  Wightman,  Kgl.  Danske  Vidensk  Selsk.  31,  No.  5  (1957). 
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From  postulate  III,  it  follows  that  (14)  holds  for  all  |.  Because  of  equation  (8) 
and  postulate  IV,  it  follows  that 

ipA^)Q^O  (15) 

This  contradicts  postulate  V. 

Non-hermitian  fields 

We  assume  that  non-hermitian  fields  are  reasonable  concepts  only  if  they  admit 
a  gauge  transformation  of  the  first  kind : 

(p{x)->(p(x)e^'';         (p*{x)->(p*{x)e-^'',  (16) 

similarly  for  spinor  fields  etc.  The  generating  operator  for  the  transformation 
may  represent  some  conserved  quantity  like  electric  charge,  baryon  number, 
strangeness  (in  strong  and  electromagnetic  interactions)  or  lepton  charge  (if  it 
is  conserved).  We  first  consider  a  spin-zero  field  9?  (a;).  From  the  above  postulate, 
it  follows  that  «p{x)<p{y)y,=  «p*(x)<f*{y)y„=0.  (17) 

We  have  to  show  that  the  assumption 

<[99*  (x),  cp  (2/)]+>o  =  0         (^  space-like)      .  (18) 

leads  to  a  contradiction.   From  postulate  I  and  equation  (18)  it  follows  that 

<9?*  {x)  (p{y)  +  (p  {x)  99*  (i/)>o  =0         (I  space-like) ,  (19) 

Using  equation  (17),  we  obtain 

<  ((P  (^)  ±  9^*  (^))  i<P  {y)  ±  (P*  {y))>o  =0         (I  space-like) ,  (20) 

where  both  the  plus  or  both  the  minus  signs  are  to  be  taken  together. 
Postulate  III  makes  equation  (20)  generally  valid.  Since  both  (p{x)  +  (p*{x)  and 
i((p(x)  —(p*{x))  are  hermitian  fields,  postulate  IV  leads  to 

(p(x)Q^(p'^ix)Q=  0  (21) 

which  contradicts  postulate  V. 

For    spin    one-half  fields,    the  argument  is  quite  similar.   The  analogue   of 
equation  (17)  is  ^^^^^^  ^^^^^^^  ^  ^.^^^^  _^(^)>^  ^  ^  ,22, 

The  assumption  to  be  disproved  is 

<bpi  i^)>  W. iy)]>o  =0         (^  space-Hke) .  (23) 

Since  the  left-hand  side  transforms  like  the  fourth -component  of  a  four- vector, 
<Wa  (^)  Wcc  {y)  +  Woe (^)  wt  iy)>o  =^^        (^  space-llke) .  (24) 

Equation  (20)  is  to  be  replaced  by 

r].^<bPA^)±CayVn^)\[My)±^f^6W*iy)]>0-.0,  •  (25) 
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where  the  matrix  C  is  the  inverse  of  7^  and  f  J^  =  ?y«^.  Since  tp^ix)  -f  CayWyi^)  *^"^ 
»(V«(^)  "~  C«yV*(^))  are  Majorana  fields,  we  have 

y>,(x)Q=-^ip,{x)Q=^0  (26) 

which  contradicts  postulate  V. 

10.  PHASE  SHIFT  ANALYSIS  OF  SCATTERING 

We  have  had  occasion  to  deal  with  phase  shifts  in  various  chapters  of  the 
book.  To  define  this  quantity  and  find  its  relationship  with  the  scattering  cross- 
section,  it  is  easiest  to  study  the  quantum  mechanical  scattering  from  a  central 
potential,  the  Schrodinger  equation  for  which  can  be  written  as 

--~V^u  +  Vu  =  Eu  (1) 

where  fj,  is  the  reduced  mass  of  the  system  of  scatter  and  the  scattered  particle. 
The  general  solution  of  this  equation  is  of  the  form 

u{r,e  =  ^Ri{r)Pi{cose)  (2) 

/  =  o 

where  Pi  is  the  Legendre  polynomial  of  order  /.  (Because  of  the  symmetry  about 
the  polar  axis,  there  is  no  dependence  on  the  azimuthal  angle.)  Assuming  that  V 
can  be  neglected  for  r  greater  than  some  distance  a,  the  most  general  form  of 
Ri  (r)  that  is  real  is 

Ri (r)  =r.  Ai[cos  di  ji (k  r)  —  sin  (5/  Ui {k  r)]  (3) 

where  di  is  real  and  ji  and  Ui  are  the  spherical  Bessel  and  Neumann  functions 
respectively,  di  is  the  phase  shift  of  the  l-th.  partial  wave  since  it  is  the  difference 
of  phase  between  the  asymptotic  forms  of  the  actual  radial  function  Ri{r)  and 
the  radial  fiinction  ^'/(A;r)  when  the  scattering  potential  is  absent. 

Ri{r)    ->    {kr)-^sm{kr-iln  +  di).  (4) 

Now  we  can  represent  the  asymptotic  form  of  jll  (r,  6)  also  in  the  form 

u{r,d)    ->    e»*^  +  ^^e**''  (5) 

where  the  first  term  on  the  right  represents  the  incident  plane  wave  corresponding 

to  a  particle  moving  in  the  positive  z-direction  and  the  second  term  to  the 

scattered  wave  which  can  be  only  outgoing,  as  the  scattered  particles  can  move 

only  away  from  the  scatter.  Now  we  have 

00 
Qikz  _  ^ikrcose  =  2J  {21+1)  i^jiikr)  Pi{co8 6) .  (6) 
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Substituting  the  asymptotic  form  of  this  in  '(5)  and  equating  it  to  the  asymptotic 
form  of  (2),  we  obtain  on  equating  the  coefficients  of  e**'*  and  e"***"  on  the  two 
sides  of  the  equation 

OO  (X) 

2  ikfid)  +  i;  (2  ?  +  1)  ii  e-^"^^P^(cos  Q)  =  Z  A  e*'('''^*^">  P^cos  Q)       (7a) 

OO  c» 

2;(2Z  +  )*^ei^'''P^(cos0)  =  i;^fe-<'5^-*^-)P^(cos0),  (7b) 

Since  these  equations  are  true  for  all  values  of  B  and  equation  (7  b)  becomes  on 
using  the  orthogonality  of  the  Legendre  polynomials 

Ai={2l-{-\)i^Qi^i.  (8) 

Substituting  this  in  (7  a)  we  obtain  the  expression  for  the  scattering  amplitude, 
viz. 

OO 

/(0)  =  (2iA;)-^2;(2Z+l)(e2^''^-l)PHcos6)).  (9) 

z=o 


The  differential  cross-section  is  given  by 

1 


G{Q)  ^  |/(0)|2  =  ^(2?H-  l)e*-''^sin6,|P,(cos0)|2  (10) 


and  the  total  cross-section  by 

4  jr      OO 

(T=^-y^2'(2/+l)sin2^/.  (11) 

1^    1  =  0 

We  thus  see  that  the  phase  shifts  completely  determine  the  scattering,  the 
cross-section  vanishing  when  each  of  the  di  is  0°  or  180°.  In  this  sense  we  can 
consider  the  >S^-matrix  as  the  phase  shift  operator  with  eigenvalue  e^^*^  when  d  is 
real,  i.e.  for  pure  scattering  processes.  (For  absorptive  processes  the  phase  shift 
will  be  complex.)  Hence  the  eigenvalue  of  the  T-matrix  will  correspondingly 
be  e**^  sin  d  while  the  reaction  matrix  k,  which  is  connected  with  the  iS-matrix 
by  the  relation 


i  k 

and  has  the  eigenvalue  tan  6.  Thus  we  can  talk  of  the  scattering  eigenphases  of 
a  system. 

If  we  compare  the  expression  (9)  for  0  =  0  (i.e.  forward  scattering)  with  (11), 
we  obtain  the  relation 

<T  =  -*^Im,(0), 

which  is  called  the  optical  theorem  or  the  Bohr-Peierls-Placzek^  relation. 


N.  Bohr  et  ah,  Nature,  Lond.  144,  200  (1939). 
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For  large  angular  momenta  when  the  classical  approximation  is  applicable, 
we  can  write  the  impact  parameter  ^  in  terms  of  I  as 

In 

where  /  is  the  de  Broglie  wavelength.  The  smallest  impact  parameter  is  given 
by  /  ^  0.  If  the  range  of  interaction  is  less  than  X\2n  then  a  particle  with  an 
angular  momentum  other  than  zero  hardly  experiences  scattering  at  all.  The  wave 
function  for  a  particle  with  angular  momentum  /  becomes  zero  like  r^  at  the 
origin.  Hence  the  probability  of  finding  a  particle  with  /  >  0  in  the  region  of  inter- 
actions is  very  small  if  the  range  of  action  of  the  forces  is  much  less  than  A/27r. 
Thus  the  method  of  partial  wave  analysis  should  be  particularly  suitable  for  low 
energy  nuclear  scattering  since  the  nuclear  forces  are  of  short  range. 

It  can  be  demonstrated  that  the  sign  of  the  phase  shift  is  related  to  the  nature 
of  the  potential.  A  repulsive  potential  makes  the  phase  shift  negative  while  an 
attractive  potential  leads  to  a  positive  phase  shift. 


See,  e.g.  L.  I.  Schiff,  Qiuinlvm  Mechanics,  McGraw-Hill,  Now  York  (1949). 
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Table  II 

Electron  distribution  (Electron  primary) 

e{n) 

[e  {n}  y  is  given  in  brackets 


2 

3 

4 

5 

6 

7 

8 

3 

1.98 

1.69 

1.21 

.779 

.475 

.280 

.160 

3.93 

(2.87) 

(1.45) 

(.606) 

(.225) 

(.0782) 

(.0256) 

4 

3.90) 

(4.34) 

3.85 

2.98 

2.11 

1.41 

.897 

(15.2) 

(18.8) 

(14.9) 

(8.90) 

(4.47) 

(1.98) 

(.805) 

o 

6.87 

9.47 

10.2 

9.27 

7.55 

5.67  ; 

4.01 

(47.3 

(89.8) 

(103) 

(86.0) 

(56.9) 

(32.1) 

(16.1) 

6 

11.2) 

18.6 

23.4 

24.7 

22.9 

19.3 

15.1 

(125) 

(345) 

(549) 

(611) 

(524) 

(372) 

(228) 

7 

33.3 

48.8 

58.8 

61.3 

57.5 

49.6 

(1110) 

(2380) 

(3450) 

(3760) 

(3310) 

(2460) 

8 

56.4 

94.2 

128 

149 

154 

146 

(3180) 

(8870) 

(1.63  X  10*) 

(2.22  X  104) 

(2.38  X  104) 

(2.13  X  104) 

9 

171 

259 

.334 

381 

393 

(2.92  X  104) 

(6.71  X  104) 

(1.12  X  105) 

(1.45  X  105) 

(1.54  X  10^) 

10 

296 

496 

703 

874 

979 

(8.73  X  10^) 

(2.46  X  W) 

(4.94  X  10^) 

(7.64  X  10^) 

(9.58  X  10^) 

12 

1580 

2660 

3890 

5070 

(2.51  X  10«) 

(7.10  X  10«) 

(1.51  X  10') 

(2.57  X  10') 

14 

8620 

(7.42  X  10') 

14500 
(2.11  X  108) 

21700 
(4.70  X  108) 

16 

47400 
(2.25  X  109) 

80100 
(6.41  X  109) 
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Table  III 
Electron  distribution  (Electron  primary) 

a^  is  given  in  brackets 


2 

3 

4 

5 

6 

7 

8 

3 

6.48 

4.49 

2.57 

1.37 

.740 

.400 

.216 

(2.55) 

(1.62) 

(1.12) 

(.76) 

(.515) 

(.322) 

(.190) 

4 

21.6 

25.4 

18.6 

12.1 

6.89 

3.76 

2.05 

(6.4) 

(6.6) 

(3.7) 

(3.2) 

(2.42) 

(1.78) 

(1.24) 

5 

63.1 

114 

126 

101 

67.9 

40.6 

22.8 

(15.8) 

(24) 

(23) 

(15) 

(11.0) 

(8.5) 

(6.7) 

6 

163 

424 

650 

698 

584 

414 

261 

(38) 

(79) 

(101) 

(87) 

(60) 

(42) 

(33) 

7 

1360 

2800 

3910 

4140 

3570 

2640 

(250) 

(420) 

(460) 

(380) 

(260) 

(180) 

8 

3900 

1.04  X  10* 

1.86  X  10* 

2.45  X  10* 

2.57  X  10* 

2.26  X  10* 

(720) 

(.15  X  104) 

(.23  X  10*) 

(.23  X  10*) 

(.19  X  10*) 

(.13  X  10*) 

9 

3.45  X  10* 

7.67  X  10* 

1.24  X  105 

1.57  X  105 

1.64  X  105 

(.53  X  10*) 

(.96  X  10*) 

(.12  X  105) 

(.12  X  105) 

(.10  X  105) 

10 

1.03  X  105 

2.82  X  105 

5.53  X  105 

8.36  X  105 

1.03  X  10* 

(.16  X  10^) 

(.36  X  105) 

(.59  X  105) 

(.72  X  105) 

(.07  X  10«) 

12 

2.91  X  10« 

8.05  X  10« 

1.68  X  10' 

2.80  X  10' 

(.40  X  10«) 

(.95  X  10«) 

(.17  X  10') 

(.23  X  10') 

14 

8.53  X  10^ 
1.11  X  10')) 

2.37  X  108 

(.26  X  108) 

5.19  X  108 
(.49  X  108) 

16 

2.56  X  109 
(.31  X  109) 

7.16  X  109 
(.75  X  109) 
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Table  IV 
Electron  distribution  (Electron  primary) 


[e{n}]*-B{n) 

a"' 
— r-r  is  given  in  brackets 
e{n) 


2 

3 

4 

5 

6 

7 

8 

3 

1.3 

1.4 

4.7 

-4.4 

-2.1 

-1.6 

-1.4 

(1.3) 

(.96) 

(.93) 

(.98) 

(1.1) 

(1.2) 

(1.2) 

4 

.57 

.46 

.34 

.54 

1.0 

3.1 

-1.3 

(1.6) 

(1.5) 

(.96) 

(1.1) 

(1.1) 

(1.3) 

(1.4) 

5 

.39 

.30 

.25 

.20 

.22 

.32 

.55 

(2.3) 

(2.5) 

(2.2) 

(1.6) 

(1.5) 

(1.5) 

(1.7) 

6 

.33 

.24 

.19 

.15 

.12 

.12 

.15 

(3.4) 

(4.2) 

(4.3) 

(3.5) 

(2.6) 

(2.2) 

(2.2) 

7 

.23 

.18 

.14 

.10 

.080 

.075 

(7.5) 

(8.6) 

(7.8) 

(6.2) 

(4.5) 

(3.6) 

8 

.23 

.17 

.14 

.10 

.081 

.061 

(13) 

(16) 

(18) 

(15) 

(12) 

(8.9) 

1) 

.18 

.14 

.11 

.083 

.064 

(31) 

(37) 

(36) 

(31) 

(25) 

10 

.18 

.15 

.12 

.094 

.073 

(54) 

(73) 

(84) 

(82) 

(72) 

12 

.16 

.13 

.11 

.089 

(250) 

(360) 

(440) 

(450) 

14 

.15 

(1300) 

.12 

(1800) 

.10 

(2300) 

10 

.14 

(6500) 

.12 

(9400) 
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Table  V 

Photon  distribution  (Electron  primary) 

e{n^} 

o^  is  given  in  brackets 


X 

2 

3 

4 

5 

6 

7 

8 

3 

7.07 

5.62 

3.82 

2.19 

1.34 

.653 

.364 

(1.30) 

(1.26) 

(1.18) 

(.94) 

(.81) 

(.450) 

(.291) 

4 

27.0 

33.1 

28.4 

19.8 

11.6 

6.59 

3.60 

(3.2) 

(3.7) 

(3.7) 

(3.6) 

(2.8) 

(2.29) 

(1.70) 

5 

90.1 

156 

180 

157 

113 

73.0 

41.9 

(10.4) 

(13.9) 

(14) 

(12) 

(11) 

(9.9) 

(8.6) 

6 

280 

623 

906 

1050 

933 

703 

471 

(54) 

(62) 

(62) 

(55) 

(43) 

(39) 

(32) 

7 

2400 

4209 

5890 

6430 

5790 

4510 

(500) 

(320) 

(310) 

(250) 

(160) 

(140)   . 

8 

162  X  102 

.  282  X  102 

373  X  102 

405  X  102 

373  X  102 

(1320) 

(17  X  102) 

(12  X  102) 

(1000) 

(900) 

9 

557  X  102 

119  X  103 

191  X  103 

245  X  103 

261  X  103 

(4930) 

(80  X  102) 

(9  X  103) 

(8  X  103) 

(5  X  103) 

10 

174  X  103 

450  X  103 

860  X  103 

130  X  10* 

159  X  10* 

(168  X  102) 

(35  X  103) 

(49  X  103) 

(5  X  104) 

(2  X  10*) 

12 

495  X  104 

131  X  10^ 

266  X  10-^ 

438  X  10* 

(47  X  10*) 

(14  X  105) 

(17  X  105) 

(17  X  105) 

14 

145  X  10« 
(15  X  10«) 

389  X  10« 
(29  X  106) 

831  X  10* 
(47  X  10«) 

16 

438  X  107 
(39  X  10') 

118  X  10» 
(8  X  108) 
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Table  VI 

Photon  distribution  (Electron  primary) 
e  {n} 
[e  {n)  ]*  is  given  in  brackets 


X 

2 

3 

4 

5 

6 

7 

8 

3 

2.40 

2.09 

1.62 

1.12 

.729 

.450 

.271 

(5.77) 

(4.36) 

(2.63) 

(1.25) 

(.532) 

(.203) 

(.0732) 

4 

4.88 

5.42 

4.97 

4.02 

2.98 

2.08 

1.38 

(23.8) 

(29.4) 

(24.7) 

(16.2) 

(8.87) 

(4.31) 

(1.90) 

5 

8.93 

11.9 

12.9 

12.0 

10.1 

7.88 

5.77 

(79.7) 

(142) 

(166) 

(145) 

(102) 

(62.0) 

(33.3) 

6 

15.0 

23.7 

29.7 

31.6 

29.8 

25.8 

20.9 

(226) 

(562) 

(880) 

(996) 

(890) 

(664) 

(439) 

7 

43.6 

62.3 

74.7 

78.6 

75.0 

66.1 

(1900) 

(3880) 

(5580) 

(6180) 

(5630) 

(4370) 

8 

122 

163 

190 

199 

191 

(149  X  102) 

(265  X  102) 

(369  X  102) 

(395  X  102) 

(364  X  102) 

9 

225 

333 

426 

487 

507 

(508  X  102) 

(HI  X  103) 

(182  X  103) 

(237  X  103) 

(257  X  103) 

10 

397 

644 

900 

1120 

1250 

(158  X  103) 

(415  X  W) 

(811  X  103) 

(124  X  10*) 

(157  X  10*) 

12 

2120 

3420 

4990 

6490 

(448  X  10*) 

(117  X  105) 

(249  X  105) 

(422  X  105) 

14 

114  X  102 
(131  X  10«) 

190  X  102 
(360  X  10«) 

280  X  102 
(785  X  10«) 

16 

632  X  102 
(399  X  10') 

105  X  103 

(110  X  108) 
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Table  VII 
Photon  distribution  (Electron  primary) 


[e{n}Y-e{n} 


8{n] 


is  given  in  brackets 


\2/ 

2 

3 

4 

5 

() 

7 

8 

3 

.39 

.56 

1.2 

7.0 

-4.1 

-1.8 

-1.5 

(.54) 

(.61) 

(.73) 

(.84) 

(1.1) 

(1.0) 

(1.1) 

4 

.17 

.15 

.19 

.30 

.47 

1.0 

3.3 

(.65) 

(.68) 

(.74) 

(.91) 

(.92) 

(1.1) 

(1.2) 

5 

.15 

.11 

.091 

.088 

.12 

.18 

.31 

(1.2) 

(1.2) 

(1.1) 

(.97) 

(1.0) 

(1.3) 

(1.5) 

6 

.26 

.11 

.076 

.057 

0.50 

.060 

.077 

(3.6) 

(2.6) 

(2.1) 

(1.7) 

(1.4) 

(1.5) 

(1.5) 

7 

.27 

.084 

.055 

.040 

.029 

0.032 

(11) 

(5.1) 

(4.1) 

(3.1) 

(2.2) 

(2.1) 

8 

.089 

.064 

.034 

.026 

.026 

(11) 

(10) 

(6.5) 

(5.1) 

(4.9) 

9 

.072 

.051 

.036 

.018 

(24) 

(22) 

(17) 

(8.9) 

10 

.085 

.060 

0.43 

.010 

(55) 

(54) 

(48) 

(13) 

12 

.10 

.071 

.067 

.039 

(220) 

(250) 

(340) 

(250) 

14 

.11 

(13  X  102) 

.081 

(15  X  102) 

.097 

.059 

(17  X  102) 

16 

.078 

(61  X  102) 

(79  X  102) 
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Table  VIII 
Electron  distribution :  New  approach  (Electron  primary) 

e  {n}  is  given  in  brackets 


4 

5 

6 

7 

8 

9 

10 

11 

0.5 

.524 

.750 

.994 

1.521 

1.521 

1.801 

2.092 

2.394 

(1.139) 

(1.338) 

(1.553) 

(1.781) 

(2.019) 

(2.268) 

(2.527) 

(2.796) 

0.6 

.729 

1.053 

1.407 

1.786 

2.188 

2.612 

3.057 

3.526 

(1.252) 

(1.530) 

(1.836) 

(2.163) 

(2.512) 

(2.880) 

(3.269) 

(3.678) 

0.7 

.961 

1.400 

1.888 

2.418 

2.988 

3.598 

4.248 

4.940 

(1.381) 

(1.751) 

(2.162) 

(2.611) 

(3.096) 

(3.616) 

(4.173) 

(4.767) 

0.8 

1.216 

1.790 

2.437 

3.151 

3.931 

4.777 

5.693 

6.679 

(1.524) 

(1.995) 

(2.530) 

(3.123) 

(3.773) 

(4.483) 

(5.254) 

(6.088) 

0.9 

1.492 

2.220 

3.054 

3.989 

5.027 

6.170 

7.423 

8.792 

(1.677) 

(2.261) 

(2.936) 

(4.548) 

(4.458) 

(5.491) 

(6.530) 

(7.670) 

1.0 

1.788 

2.690 

3.740 

4.938 

6.288 

7.797 

9.745 

11.330 

(1.837) 

(2.546) 

(3.378) 

(4.336) 

(5.424) 

(6.649) 

(8.020) 

(9.546) 

1.1 

2.103 

3.199 

4.497 

6.003 

7.726 

9.682 

11.887 

14.360 

(2.003) 

(2.846) 

(3.856) 

(5.039) 

(6.406) 

(7.970) 

(.9747) 

(11.752) 

1.2 

2.433 

4.735 

5.326 

7.189 

9.356 

11.851 

14.703 

17.940 

(2.172) 

(3.159) 

(4.366) 

(5.806) 

(7.498) 

(9.466) 

(11.733) 

(14.328) 

1.3 

2.779 

4.327 

6.226 

8.502 

11.189 

14.329 

17.965 

22.145 

(2.343) 

(3.485) 

(4.908) 

(6.638) 

(8.705) 

(11.147) 

(14.002) 

(17.310) 
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Table  IX 
Electron  distribution:  New  approach  (photon  primary) 

8{N} 

s  {n}  is  given  in  brackets 


X 

4 

5 

6 

7 

8 

9 

10 

11 

0.5 

.710 

.766 

.822 

.879 

.938 

1.000 

1.063 

1.128 

(.682) 

(.746) 

(.806) 

(.865) 

(.925) 

(.987) 

(1.050) 

(1.115) 

0.6 

.063 

.951 

1.044 

1.141 

1.243 

1.349 

1.459 

1.573 

(.817) 

(.917) 

(1.015) 

(1.114) 

(1.216) 

(1.323) 

(1.432) 

(1.546) 

0.7 

1.025 

1.157 

1.298 

1.450 

1.611 

1.781 

1.959 

2.146 

(.955) 

(1.102) 

(1.249) 

(1.403) 

(1.564) 

(1.733) 

(1.910) 

(2.095) 

0.8 

1.198 

1.384 

1.590 

1.813  ■ 

2.054 

2.311 

2.584 

2.872 

(1.097) 

(1.300) 

(1.511) 

(1.736) 

(1.975) 

(2.229) 

(2.499) 

(2.784) 

0.9 

1.382 

1.636 

1.922 

2.237 

2.582 

2.954 

3.354 

3.781 

(1.243) 

(1.513) 

(1.803) 

(2.117) 

(2.457) 

(2.823) 

(3.216) 

(3.637) 

1.0 

1.579 

1.914 

2.298 

2.729 

3.206 

3.727 

4.293 

4.906 

(1.391) 

(1.741) 

(2.125) 

(2.550) 

(3.016) 

(3.526) 

(4.080) 

(4.680) 

1.1 

1.788 

2.219 

2.722 

3.295 

3.935 

4.645 

5.426 

6.281 

(1.541) 

(1.982) 

(2.479) 

(3.037) 

(3.659) 

(4.349) 

(5.109) 

(5.492) 

1.2 

2.011 

2.553 

3.196 

3.939 

4.782 

5.728 

6.781 

7.947 

(1.694) 

(2.237) 

(2.864) 

(3.580) 

(4.392) 

(5.304) 

(6.322) 

(7.452) 

1.3 

2.246 

2.916 

3.723 

4.668 

5.756 

6.992 

8.385 

9.947 

(1.847) 

(2.505) 

(3.280) 

(4.182) 

(5.220) 

(6.403) 

(7.741) 

(9.243) 
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12.  RESONANCE  STATES  OF  STRONGLY  INTERACTING  PARTICLES 

(1)  The  .T  -  X  resonance 

The  existence  of  this  resonance  was  first  suspected  from  a  study  of  the  iso- 
vector  part  of  the  nucleon  electromagnetic  form  factor.^  Subsequently  an  ex- 
perimental study  2  of  the  reactions 

has  revealed  the  existence  of  this  resonance  via  a  peak  in  the  0- value  distribution 
of  the  two  pions.  The  mass  of  this  resonance  is  765  MeV  with  a  half- width  of 
75  MeV.  Experiments  also  reveal  that  the  quantum  numbers  of  the  resonance 
are  /  =  1  and  J  =  1. 

(2)  The  3;r  resonances 

(a)  Experiments  3  tend  to  indicate  a  spectrum  anomaly  of  He^  produced  in 
the  collision  of  protons  with  deuterium  in  the  form  of  a  peak  in  the  region  cor- 
responding to  double  pion  production.  The  isotopic  spin  of  the  particle  which 
this  suggests  has  been  measured  to  be  /  =  0.  It  is  also  suggested*  that  the  particle 
has  J  =  1  since  it  has  never  been  observed  in  A'-decay.  Thus  it  seems  to  be 
a  particle  of  the  nature  of  a  3:7r-bound  state. 

(b)  Evidence  has  recently  been  reported^  for  an  /  =  0,  J=l  resonance 
(which  is  distinct  from  the  Stt  bound  state  discussed  in  (a))  in  a  study  of  the 
effective  mass  distribution  of  triplets  of  pions  in  the  reaction 

p  -\-  P  ->  71^  -{-  71^  -\-  n~  -\-  7l~ 

The  mass  of  the  resonance  is  787  MeV  with  a  half- width  7^/2  <  15  MeV  corre- 
sponding to  a  life -time  t  >  4  x  10  '^^  sec. 

(3)  The  K-  71  resonance 

Recently  a  study  of  the  reactions 

K-  +  p-^K-  +  Tt''  +  p       (a), 

-^K^  +  71-  +  p       (b) 

has  revealed  the  existence  ofsbK  —  Tt  resonance.^  The  branching  ratio  at  resonance 
for  the  two  channels  (a)  and  (b)  is  predicted  to  be  2  and  1/2  for  7=1/2  and  3/2 

1  W.  R.  Frazer  and  J.  R.  Fulco,  Phya.  Rev.,  117,  1609  (1960). 
-  Erwin  et  al.,  Phys.  Rev.,  Letters,  6,  628  (1961). 

^  A.  Abashian,  N.  E.  Booth  and  K.  M.  Crowe,  Phys.  Rev.  Letters,  5,  258  (1960);  iJee;.  Mod. 
Phys.,  33,  393  (1961). 

^  G.  F.  Chew,  Rev.  Mod.  Phys.,  33,  394,  (1961). 

5  Maglic  et  al.,  Phys.  Rev.  Letters,  7,  178  (1961);  Phys.  Rev.  125,  687  (1962). 

«  Alston  ei  al.,  Phys.  Rev.  Letters,  6,  300  (1961). 
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states  respectively.  The  observed  ratio  is  1.4  ±  0.4  indicating  the  /=  1/2  state 
for  the  resonance.  The  resonance  has  a  mass  of  885  ±  3  MeV  with  a  full  width 
at  half-maximum  of  16  MeV.  The  spin  of  the  resonance  is  not  as  yet  known 
although  the  isotropic  angular  distribution  of  the  K  and  tz  from  the  decay  of 
the  resonance  suggests  J  =  0  or  1 . 

(4)  The  A-7t  resonance 

This  resonance  was  first  observed  in  the   15  in.  Berkeley  hydrogen  bubble 
chamber  in  the  reaction 

K-  +  p  -^  A^  -i-  71^  +  Tr- 
ior incident  iiT'-momentum  of  1.15  BeV/c.^  The  resonance  has  been  christened 
Y*  and  clearly  has  isotopic  spin  1.  The  mass  of  the  resonance  is  1385  MeV  with 
a  half- width  of  about  25  MeV.  The  helium  chamber  collaboration  group  ^  have 
also  observed  it  in  the  reaction 

K-  +  He^  ^A^  +  71-  +  He". 

Their  analysis  of  the  angular  distribution  of  the  FJ*  decay  products  suggests 
Jrr.  1/2  for  the  resonance.  This  inference  however  needs  further  corroboration. f 

(5)  The  S  -7t  resonance 

This  resonance  has  been  observed  in  the  study  of  the  reactions  ^ 

K-  +  p^  E^  +  71-  +  71^  +  71-, 
->-  Z-  +  71^  +  71'^  +  71-, 

->  r^  +  7r«  +  7r+  +  71-, 

-^  A^  +  71^  +  71"-  +  71-. 

The  Q-value  of  the  final  E  —  7Z  system  shows  a  peaking  at  about  1400  MeV  in- 
dicative of  a  resonance.  The  corresponding  distribution  of  Q{A7i)  shows  no  such 
behaviour  suggesting  that  the  observed  E  —  7i  resonance  is  something  distinct 
from  7*.  (Notice  that  7*  cannot  decay  into  E^  +  7i^  since  E^  +  7i^  has  no  /  =  1 
component.)  The  relative  number  of  events  with  E^7i^  in  the  final  state  compared 
to  those  with  E^7i-  and  E-7t^  in  the  final  state  near  resonance  suggests  1=0 
for  the  resonance  which  has  therefore  been  called  TJ.  The  mass  of  the  resonance 
is  about  1400  MeV  with  a  half- width  of  about  20  MeV.  The  spin  of  the  resonance 
is  not  yet  known. 


^  Alston  et  al.,  Phys.  Rev.  Letters,  6,  520  (1960);  Berge  et  al.,  Phys.  Rev.  Letters,  6,  557 
(1961);  Dahl  et  al.,  Phys.  Rev.  Letters,  6,  142  (1961);  Helium  Chamber  Collaboration  Group, 
Nuovo  Cimento,  20,  724  (1961);  M.  H.  Alston  and  M.  Ferro-Luzzi,  Rev.  Mod.  Phys.,  33,  416, 
(1961). 

2  Alston  et  al,  Phys.  Rev.  Letters,  6,  698  (1961). 

t  There  is  now  evidence  that  J^  3/2  for  the  7*  ;  ref.  R.  P.  Ely  et  al.,  Phys.  Rev.  Lett.  7, 
461  (1961). 
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(6)  The  K  -  .V  resonance 

A  resonance  in  the  K~  —  p  system  with  a  mass  of  1810  MeV  has  been  reported 
recently.*  The  cross-section  curve  for  K~  —  n  scattering  reveals  no  corresponding 
structure  indicating  that  the  resonance  is  in  the  7  =  0  state. 

There  also  seems  to  be  some  evidence  for  a  resonance  in  the  K—p  system 
at  a  centre-of-mass  energy  of  about  95  MeV. 


13.  GAUGE  THEORIES  OF  ELEMENTARY  PARTICLES 

While  Heisenberg's  theory  attempts  to  be  very  comprehensive  in  its  scope 
and  is  a  departure  from  the  conventional  field  theories,  there  is  still  a  belief 
that  a  dynamical  theory  of  elementary  particles  is  possible  within  the  present 
framework.  Of  particular  interest  in  this  connection  are  the  attempts  of  Gell- 
Mann^,  Neeman^,  Sakurai*  and  Salam  and  Ward^  which  are  in  a  sense  sequels 
to  the  work  of  Yang  and  Mills.* 

Actually  all  these  stem  from  the  weU-known  principle  in  electrodynamics 
(already  referred  to  in  Chapter  XX)  which  states  that  the  existence  of  the  electro- 
magnetic field  is  a  necessary  consequence  of  demanding  the  invariance  of  the 
Lagrangian  under  gauge  transformations  of  the  first  kind  on  the  electron  field : 

V)(a:)  =  e*«^(^)y)(a;).  (1) 

With  constant  A,  {I)  merely  expresses  the  fact  that  charge  is  conserved  in  these 
interactions.  When  yl  is  a  function  of  x,  the  transformation  acquires  a  deep  and 
beautiful  significance.  The  free  Lagrangian 

L=  -if){iy^d^+m)\p  (2) 

(where  we  have  written  y^^^  for  y)  is  no  longer  invariant  under  (1)  and  we  are 
constrained  to  add  to  it  a  term 

Unt=  -ey>y^y)A^  (3) 

where  ^^  is  a  new  field  which  may  be  identified  with  the  electromagnetic  field 
and  which  transforms  as 

A^(x)-^A^(x)  +  ^  (4) 

when  y){x)  transforms  as  in  (1).  The  transformation  (4)  is  called  a  gauge  trans- 
formation of  the  second  kind  and  implies  that  the  electromagnetic  field  is  massless. 
In  this  sense  then,  the  existence  of  the  electromagnetic  field  is  a  consequence  of 


1  L.  T.  Kerth,  Rev.  Mod.  Phys.,  33,  389,  (1961). 

-  M.  Gell-Mann,  Phys.  Rev.  (to  be  published). 

^  Y.  Neeman,  Nuclear  Physics,  26,  222  (1961). 

^  J.  J.  Sakurai,  Ann.  Phys.,  11,  1  (1960). 

^  A.  Salam  and  J.  C.  Ward,  Nuovo  Cimento,  19,  165  (1961);  20,  419  (1961). 

«  C.  N.  Yang  and  R.  L.  Mills,  Phys.  Rev.,  96,  191  (1954). 
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the  conservation  of  charge.  It  is  to  be  emphasized  that  in  variance  under  (1) 
implies  that  the  phase  of  the  charged  field  can  be  altered  from  one  space  time 
point  to  another  arbitrarily,  i.e.  the  local  phase  of  the  field  is  not  a  quantity  of 
significance.  Thus  when  (1)  is  a  symmetry  of  the  system,  the  notion  of  a  local 
field  acquires  a  deeper  significance  than  what  is  conventionally  attributed  to  it. 

It  may  be  recognized  that  what  has  been  done  here  to  generate  the  electro- 
magnetic field  is  to  replace  the  ordinary  derivative  d^  in  the  free  Lagrangian 
b}^  the  CO  variant  derivative^  S^^  ieA^  corresponding  to  the  space-time  dependent 
transformation  (1).  A  leading  attempt  to  extend  this  idea  to  other  conservation 
laws  as  well  was  that  of  Yang  and  Mills  who  by  the  localization  of  isotopic 
spin  rotations  "derived"  the  existence  of  a  triplet  of  vector  mesons  coupled 
to  the  isotopic  vector  current.  Sakurai  subsequently  tried  to  build  a  theory 
of  strong  interactions  by  localization  of  the  gauges  belonging  to  the  three  con- 
servation laws  peculiar  to  these  interactions,  viz.  baryon  number,  hypercharge 
and  isotopic  spin.  In  the  latter  theory,  it  is  essential  that  there  be  only  two 
fundamental  elementary  fields  apart  from  the  vector  mesons.  The  chief  difficult}^ 
in  these  theories  is  that  they  give  vector  mesons  of  zero  mass  in  analogy  with 
electromagnetism  (except  possibly  for  those  associated  with  isotopic  spin 
rotations)^  for  which  no  experimental  evidence  exists. 

Theories  generated  by  such  gauge  transformations  have  their  interaction 
forms  almost  uniquely  fixed.  Thus  if  the  symmetry  property  is  expressed  by^ 

W'=[l-\-iIa]'F  (5) 

where  /  is  the  generator  of  the  transformation,  the  co variant  derivative  is 

F,^e^-iIA^.  (6) 

Thus  the  problem  of  finding  the  correct  interactions  is  reduced  to  one  of 
finding  the  correct  symmetry  properties.  For  instance,  let  us  assume  that  a 
scalar  a  particle  exists  and  write  the  kinetic  energy  part  of  the  free  Lagrangian  as 

Lke=  N,^iYA^L+  Nniy^d,Nn+  i(6,^)'+  i{6,or  (7) 

where 

NL-^i{l  +  y,)N;     Nji=  iil- y,)N  (8) 

and  N  and  :7r  are  the  nucleon  and  pion  fields.  The  widest  possible  symmetries 
of  this  Lagrangian  are  those  contained  in  the  four-dimensional  rotation  group 
with  {Nj^ ,  iVjj)  and  (ct,  a)  forming  four  vectors  (cf.  F.  Gursey's  work  discussed 
in  Chapter  7,  Sec.  3).  Correspondingly  we  have  six  vector  fields  U  and  V  in  the 
covariant  derivatives  and 

i^int  =  [Niy^y.rN  +  {d,o)  jt  -  a(a^jr)]  F^ 

+  [Niy^rN  -  ^{d^jt  x  Jt  -  Ji  x  d^jt)]  U^  (9) 

+  i{jtx  U^  +  oV^r+i{jt-V^r 

^  For  the  definition  of  covariant  derivative  see,  for  example,  A.  Einstein,  The  Meaning 
of  Relativity,  Methuen  (1946),  p.  69* 

2  A.  Salam,  Rev.  Mod.  Phys.,  33,  426  (1961). 
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We  notice  that  the  fermion  mass  term  is  not  invariant  under  this  group,  whicli 
is  a  very  unsatisfactory  feature.  There  has  also  been  a  detailed  investigation  of 
other  symmetry  groups  in  this  connection.  We  present  below  Salam  and  Ward's 
considerations  (ref.  6,  p.  548)  on  the  unitarj^  group  in  three  dimensions  where 
the  elementary  fields  are  taken  to  he  p,  n  and  A  as  in  the  Sakata  model.  Let 
P 

.  The  kinetic  energy  part  of  the  Lagrangian  then  reads 


.t*  = 


The  unitary  transformation  is 


x->[l  +  iH]x 


(10) 

(11) 


where  H  =  oigH^  and  od^s  are  infinitesimal.  There  are  eight  spurless  matrices  H^ 
corresponding  to  the  fact  that  this  is  an  eight-parameter  group.  A  representation 
of  H  is  generated  by 

pp      pn      nA 
H'  =  xx=^      rip       nn       nA    .  (12) 

Ap     An     AA 


This  is  not  irreducible  since  it  has  non-vanishing  spur.  The  matrix 


H  =  H'  -^      Spur     H'  X  I 


(13) 


is  therefore  irreducible.  The  co variant  derivative  is  defined  through 


p^  =  a^-iF^ 


where  H^  is  obtained  from  H  by  the  identifications  of    Py  +  ^^^-2^^    with 

a  vector  meson  ^°  with  /  =  0,  PP-^^  with  the  neutral  component  7i°  of  a  triplet 
of  vector  mesons  etc.  Thus  ' 


^,.= 


2_  o  ,  _L,» 

C6  ^''  "^  C2     " 


Til 


n: 


ye 


Q, 


1     -jr" 


K: 


K: 


K- 


Kl 


,■6 


e;; 


(15) 


where  jr^,   and   K^  are   vector  mesons   with  /-spin   and  strangeness  identical 
with  those  of  n  and  K  and  ^°  has  7=0.  The  Sakata  Lagrangian  now  reads 


+  mass  terms  for  p,  n,  A  and  H  fields. 


(16) 
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The  TT.f/s  occurring  here  correspond  to  the  Yang-Mills  triplet  of  bosons  (ref.  5, 
p.  548)  and  may  be  the  1=1,  J=l,jz  —  jt  resonance  which  has  been  recently 
observed.^  Similarly  the  K^^  and  q^  may  also  be  the  observed  K  —  n"^  and  Stt^ 
resonances. 

Pseudovector  currents  may  be  generated  by  considering  transformations  on 
Xi^  J(l  +  y^)x  and  Xj^^  \{\  —  y^)x  separately: 


xl-^{\  +iH^)xL, 


^ij->(l  -iH2)xR 


(17) 


since  Nf^  =  \{Hi^  +  H^^i)  and  H'^^  =  \{H-^^  —  H^^)  are  vector  and  pseudovector 


respectively.  It  is  possible  that  the  components  of  the  divergence  -r~/     which  is 

not  zero  because  of  the  presence  of  mass  terms  in  the  Lagrangian,  may  be  the 
71  and  K  fields  themselves. 

Weak  interactions  can  also  be  considered  in  this  scheme.  An  irreducible  re- 
presentation of  the  unitary  group  is  provided  by 


B 


ph° 

b 

h     ~ 

6* 

1    ^o 
V2^ 

^'• 

6* 

1     ho 

^-j 

(18) 


where  h^  is  real.  A  gauge  transformation  of  the  type  a;^-^(l  +  iB)xj^  would 
lead  to  a  weak  interaction  of  the  type 

Lint  =  iVLy,inj,  +  plYi^AlW^  + 

+  -^  V-^VLy^^nL  +  niy^TiL  +  ALyf^AL  +  r?xy^,yli  +  ALy^ni]  J5°     (19) 


we  may  consider  the  triplet 


for  leptons  to  define 


Analogous  to 
the  gauge. 

Other  considerations  of  a'  similar  nature  are  those  of  Neemann^  and  of 
Gell-Mann^  referred  to  before.  From  the  latter's  considerations,  there  emerges 
a  theory  of  weak  interactions  and  a  picture  of  strongly  interacting  particles  which 
is  rather  unified.  Since  the  basic  fields  are  those  of  the  Sakata  model,  the  theory 

inevitably  leads  to  the  well-known  |zl  /|  =  "2-  and  -^^  =  +  2  rules  for  strangeness- 

non- conserving  currents  in  weak  interactions.  Any  evidence  that  the  latter  is 
wrong  may  then  suggest  that  the  group  has  to  be  widened  still  further  to  take 
these  experimental  facts  into  account. 


1  Erwin  et  ah,  Phys.  Rev.  Letters,  6,  628  (1961). 
-  Alston  et  ah,  Phys.  Rev.  Letters,  6,  300  (1961). 

^  A.  Abashtan,  N.  E.  Booth  and  K.  M.  Crowe,  Phys.  Rev.  Letters,  5,  258  (1960) ;  Rev.  Mod. 
P%s.,  33,  393(1961). 
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14.  THE  PRINCIPLE  OF  EQUIVALENCE  FOR  ALL  STRONGLY 
INTERACTING  PARTICLES 

The  principle  of  equivalence^  states  that  the  correct  theory  should  not  allow 
us  to  decide  which  particles  are  elementary. 

The  mathematical  framework  in  which  this  principle  has  been  expressed  by 
Chew  and  Frautschi  uses  the  analytic  continuation^  of  the  /S-matrix  as  a  function 
of  the  complex  energy  E  and  angular  momentum  J,  which  has  been  shown  to 
be  possible  for  potential  scattering  and  for  any  amplitude  obeying  the  Mandel- 
stam  representation. 

In  the  union  of  the  complex  J  and  E  planes,  the  locus  (x{E)  of  a  single  pole 
in  the  J  plane  (as  E  varies)  is  an  analytic  function  of  E.  If  Rea  [E)  passes  through 

a  positive  integer  or  zero,  with  ^(Rea)  >  0,  then  for  this  integer,  there  occurs  a 

physical  resonance  or  bound  state.  To  each  locus  (Xi{E),  therefore,  corresponds 
a  family  of  poles  with  different  {J,E);  these  poles  are  called  "Regge  poles". 
The  principle  of  equivalence  may  be  satisfied  by  postulating  that  all  the  poles 
of  the  iS-matrix  are  Regge  poles;  this  is  an  extension  to  angular  momentum 
of  the  ''principle  of  maximum  analyticity." 

Poles  corresponding  to  elementary  particles  can  arise  if,  for  certain  internal 
quantum  numbers,  the  relativistic  continuation  in  J  is  restricted  to  a  smaller 
region  than  that  for  the  non-relativistic  case  and  the  region  Re  J  <  JmiA^)^  J  mm 
{E)  >  0,  is  excluded  from  the  domain  of  the  J,  E  space  into  which  the  /S-matrix 
can  be  continued. 

The  question  whether  or  not  all  poles  of  the  /S-matrix  are  Regge  poles,  which 
will  decide  whether  or  not  the  conventional  (Lagrangian)  field  theory  is  correct, 
may  be  decided  experimentally.  Regarding  a  (^)  as  (x{s),  s  =  E^,  one  may  either 
(i)  work  in  positive  s  and  look  for  families  of  particles  (corresponding  to  families 
of  Regge  poles),  or  (ii)  work  in  negative  s  and  trace  out(x(s)  for  a  range  of  5  from 
the  asymptotic  energy  variation  of  the  backward  peak  in  the  crossed  channel. 
The  width  and  tail  of  the  backward  peak  vary  differently  according  as  the  scatter- 
ing is  controlled  by  a  Regge  pole  or  an  elementary  particle  pole. 

The  quantum  numbers  of  the  vacuum  inevitably  receive  special  emphasis; 
a  proposal  is  that  all  forward  diffraction  peaks  may  be  controlled  by  a  Regge 
pole  with  the  internal  quantum  numbers  of  the  vacuum. 

Better  high-energy  experiments  may  decide  whether  the  principle  of  maximum 
analyticity  in  J  is  valid. 


1  G.  F.  Chew  and  S.  C.  Frautschi,  Phys.  Rev.,  Letters,  7,  304  (1961);  8,  41  (1962). 

2  T.  Regge,  Nuovo  Cimento,  14,  951  (1959);  Nouvo  Cimento,  18,  947  (1960). 
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